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Corr*l.tioa  Algor itha  Report 


1. 


1.1.  Purpose 


CL^ 


This  report#  describes  the  findings  of  the  Algorithm  Analysis  Subtask 
group  working  on  the  U.S.  Army  Intelligence  Center  and  School  (TJ&AICS)  Software 
Analysis  and  Management  System  task  (USAMS)  regarding  Electronic  Intelligence 
(ELINT)  correlation  algorithms  used  in  five  of  the  intelligence-gather ing 
systems  under  USAICS  cognizance.  The  statistical  mathematics  on  which  the 
algorithms  are  based  is  examined  with  particular  reference  to  assumptions. 
Individual  algorithms  are  analyzed  to  determine  whether  they  are  performing 
their  functions  properly.  Algorithms  that  perform  the  same  function  in  differ¬ 
ent  systems  are  compared  to  determine  which  ones  are  best  according  to  various 
criteria.  _ _ _ 


The  algorithms  examined  in  this  report  are  taken  from  the  BETA,  TCATA 
ELINT,  ITEP,  QUICILOOI,  and  AGTELIS  systems.  They  were  chosen  from  the  more 
than  40  deployed  intelligence  systems  for  which  USAICS  is  Combat  Developer 
because  some  documentation  was  available  and  because  they  represented  a  range 
of  ELINT  applications.  The  ELINT  correlation  algorithms  have  been  chosen  since 
they  are  most  nearly  automatic,  that  is,  require  the  least  operator  interven¬ 
tion  and  rely  on  technical  parameters  most  amenable  to  statistical  techniques. 


1.2.  Background 


Eaoh  of  the  more  than  40  intelligence  systems  under  USAICS  cognizance 
employs  several  types  of  algorithms  to  carry  out  its  gathering  and  processing 
of  intelligence  data.  Two  important  types  of  these  algorithms,  geographic 
transformation  and  correlation,  have  been  chosen  for  analysis  during  this  year. 
The  former  translates  grid-zone  locations,  for  example,  from  latitude-longitude 
to  Universal  Transverse  Mercator  (UTM),  while  the  latter  resolves  many  indivi¬ 
dual  sitings  into  militarily  recognizable  targets  and  situation  reports  based 
chiefly  on  standard  statistical  procedures.  It  is  important  to  develop  a  set 

•A  report  on  geographical  transformation  algorithms  has  been  submitted  in  FT82 
and  a  report  on  possible  algorithm  analysis  methodologies  is  scheduled  for 
FT83. 
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of  parameters  to  characterise  these  algorithms  to  determine  how  they  should  ho 
catalogued.  Whoa  thoso  activities  aro  completed,  it  becomes  possiblo  to  compare 
algorithas  that  perform  tho  same  function  la  difforeat  systoas  aad  finally  to 
develop  improved  algorithas  that  porfora  tho  function. 

For  this  roport  tho  JPL  Algor  itha  Analysis  Sab  task  group  has  oxaaiaod 
ELINT  radar  oorrolatioa  algorithas  for  five  of  tho  systoas  uador  USAICS  cogni¬ 
zance,  aaaoly  Battlof  iold  Exploitation  aad  Targot  Acquisition  (BETA)*  TSADOC 
Coabiaod  Aras  Traiaiag  Activity  (TCATA)  ELINT  Processor,  Intoria  Tactical 
ELINT  Processor  (ITEP),  Airborne  Non-Communication  Eaitter  Location  aad  Iden¬ 
tification  Systea  (Quicklook),  aad  Autoaated  Ground  Transportable  Eaitter  Loca¬ 
tion  and  Identification  Systea  (AGTELIS).  BETA  is  a  Test  Bed  prograa  for 
correlating  data  received  froa  several  types  of  sensor  systeas  and  asking 
target  nominations.  Both  autoaatio  correlation  and  aggregation  techniques  and 
interactive  graphics  are  used  in  the  operator's  analysis.  The  TCATA  ELINT 
Processor  and  ITEP  are  sinilar  data  analysis  tools  that  integrate  many  sitings 
into  various  intelligence  reports.  AGTELIS  and  Quicklook  are  both  collection 
and  analysis  systems*,  as  collection  systems  they  do  not  integrate  data  from  as 
wide  a  range  of  sensor  systems  as  do  the  others.  These  systems  would  generally 
be  employed  at  Brigade  through  Corps  level  or  at  an  Air  Force  Tactical  Air 
Control  Element  (TACE)  or  Allied  Tactical  Air  Force  (ATAFh  target  nominations 
and  tactical  situation  reports  would  be  available  to  commanders  and  their 
staffs  from  Brigade  through  Echelons  Above  Corps  (EAC). 

USAICS  has  cognizance  of  a  large  number  of  algorithms  integral  to 
intelligenoe-gathering  systems  in  various  stages  of  development  and  deployment. 
The  state  of  "deployment"  of  algorithms  in  the  USAICS  inventory  ranges  from 
that  of  products  of  research  contracts  not  yet  implemented  in  any  system  to 
those  fielded  systems  such  as  Quicklook.  In  the  latter  systems  the  algo¬ 
rithms  are  documented  in  design  documents  (narrative  English  and  equations), 
and/or  in  machine  readable  design  language,  and  in  code.  Often  not  all  of 
these  forms  of  documentation  are  available  for  any  one  system.  For  research 
algorithms  not  yet  implemented,  actual  code,  or  even  detailed  flow  charts,  may 
not  be  available,  aad  analysis  must  rely  solely  on  mathematical  descriptions. 

"Algorithms"  will  mean  any  set  of  rules  for  carrying  out  a  single 
conceptual  operation  on  a  set  of  data,  such  as  transforming  from  latitude  aad 
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longitude  to  UTM  coordinates  or  determining  i  position  from  i  fttttbir  of  direc¬ 
tion  aeuutMftti  taken  at  toovn  point!*11  Algorithms  are  of  toft  hierarchical, 
lover— levei  algorithms  hoimg  uaod  to  doaoribo  higher-level  algorithms  aad 
thereby  illftmiaatiftg  tha  underlying  logioal  structure*  That  results  from  oao 
algorithm  may  bo  data  for  another*  This  occurs  extensively  for  tha  oorrslatioa 
algorithms,  aad  oorroetly  idoatifyiag  the  as  samp  t  ions  mads  in  linking  tha  hier- 
arohical  larols  is  critical.  USAICS  is  intarastad  la  algorithms  parformiag 
iatslligsncs  data  proeassiag  functions  oaatral  to  its  systems*  missioas  aad 
thosa  parformiag  crucial  support  fuaotions  common  to  a  number  of  systems  such 
as  geographic  location*  Data  management  or  mathematical  function  algorithms, 
although  vital  to  the  efficient  functioning  of  the  systems,  are  not  being 
treated  in  these  first  algorithm  analyses* 

1*3*  User  Benefits 

These  analyses  can  benefit  users  in  several  ways*  First,  a  catalogue 
of  existing  algorithms  will  help  USAICS  avoid  having  algorithms  redeveloped  for 
new  systems  from  first  principles*  Second,  analysis  of  individual  algorithms 
may,  in  a  few  cases,  identify  deficiencies  worth  correcting  on  the  next  system 
revision*  Third,  and  most  important,  the  comparison  of  algorithms  performing 
the  same  function  in  different  systems  can  lead  to  identifying  guidelines  for 
developing  and/or  selecting  algorithms  to  include  in  a*w  and  revised  systems* 
Seleoted  algorithms  from  the  systems  studied  will  begin  to  form  a  library  of 
intelligence  algorithms  with  associated  computer  subroutines  that  will  be 
analogous  to  the  Collected  Algorithms  of  the  Association  for  Computing  Machin¬ 
ery  (ACM).  The  creation  of  such  a  library  is  in  the  spirit  of  Ada+,  the 
Department  of  Defense  language  for  embedded  systems,  and  the  Ada  environment* 

1,4,  High-Level  Schematics 

There  are  several  steps  in  identifying  enemy  locations  using  electro¬ 
nic  intelligence  methods*  These  steps  are  arranged  in  a  hierarchy  beginning  at 
the  bottom  with  ooserving  aad  estimating  emitter  characteristics  and  moving 

^These  conceptual  models  can  be  presented  simply  aad  logically,  but  the  pre¬ 
sentation  of  their  technical  implementation  is  often  significantly  more  compli¬ 
cated  to  present* 

+Ada  is  a  trademark  of  the  Department  of  Defense, 


tk«o«|k  sioensiTi  lmlt  of  data  integration.  Oftaa  oaa  of  several  aisup* 
tioaa  concerning  the  behavior  of  these  observations  oaa  bo  oboooa  vboa  moving 
froa  oaa  level  of  tba  hierarchy  to  tba  aaxt.  How  aacb  level  it  aodalad  aad 
vbat  analytical  techniques  ara  obosaa  depend  oa  tbaaa  assumptions.  Lata:  in 
tbia  report  tba  aaaaaptioaa  babiad  a  oaa  developed  ayataaa  will  ba  discussed. 
Tbia  aaotioa  iatrodnoaa  a  fraaavork  for  tba  aatira  hierarchy*  providing  a 
ooatazt  for  tba  aora  datailad  aad  taebaieal  diaeaaaiona  to  follow. 

Tbara  ara  foar  atagaa  ia  tba  aatoaatio  processing  of  ELINT  data  as 
iaplaaaatad  in  aost  currant  systems:  collection*  separation*  self-correlation, 
aad  eross-oorrelation.  Tbaaa  ara  illuatrat ad  scbeaatioally  in  Figures  1-1  and 
1-2. 

Tbe  first  step  in  this  "automatic"  radar  target  acquisition  and 
noaiaation  process*  called  oollectioa*  gathers  lines-of-bearing  and  signal 
paraaatriea  associated  with  different  emitters.  Examples  of  signal  paraaetrics 
normally  collected  are  radar  frequency*  pulse  repetition  interval*  and  pulse 
width.  During  collection*  error  in  tbe  estimates  of  enemy  radar  location  and 
signal  paraaetrics  eaters*  primarily  through  measurement  error.  Understanding 
this  error  ia  vital  to  understanding  sc lf-oor relation.  Tba  different  assump¬ 
tions  about  tba  behavior  of  this  error  made  will  be  discussed  in  detail*  but 
the  aotual  mathematics  of  collection  will  be  deferred  to  a  later  report. 

The  second  step*  called  separation*  identifies  which  observations  coma 
from  which  emitters.  A  subset  of  observations  is  thus  identified  for  each 
emitter.  Bach  subset  is  a  sample  from  the  population  of  all  possible  observa¬ 
tions  of  that  radar  by  a  sensor  system.  If  the  sensors  are  unbiased*  they  will 
gather  samples  whose  averages  will  estimate  tbe  true  radar  characteristics. 
(Because  tba  sensors  are  unbiased*  tbe  true  radar  characteristics  and  the  mean 
of  all  possible  observations  are  identical).  The  sample  variance  can  be  used 
to  estimate  tbe  measurement  error.  Separating  one  radar's  sample  out  of  the 
collected  observations  and  determining  tbe  mean  and  elliptical  error  probable 
(EBP)  is  usually  called  determining  tbe  "fix”  for  a  radar.  How  well  tbe 
observations  are  separated  according  to  their  populations  depends  on  tbe  den¬ 
sity  of  tbe  radars*  tbe  accuracy  of  tbe  sensor  system  for  both  locations  and 
signal  paraaetrics*  aad  the  statistical  techniques  chosen.  Examples  of  statis¬ 
tical  techniques  aed  are  jackknifing  and  sequential  searching. 
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Jackknifing  determines  if  a  sab  sat  of  observations  is  internally 
consistent  by  evaluating  bow  closely  ssob  of  tbs  ob serrations  can  bs  estimated 
using  tbs  rest  of  tbs  observations.  To  do  this,  for  each  observation  z  tbs 
location  sstinats  of  tbs  remaining  obssrrations  is  oaloulatsd  and  tbs  differ- 
snos  bstwssn  its  line-of-b  earing  and  that  of  z  is  determined.  If  for  srsry  z 
in  tbs  subset  this  difference  is  less  than  a  prespecified  value,  the  subset  is 
consistent  and  considered  as  coning  fron  tbs  sans  emitter. 

Sequential  searching  usually  involves  taking  all  observations  falling 
within  some  distance  of,  or  somehow  "clustering”  with,  another  observation 
and  determining  if  they  are  dense  enough  to  be  considered  as  coming  from  one 
emitter.  This  density  may  be  determined  by  calculating  the  variance  from  the 
best  location  estimate.  This  process  is  then  repeated  for  other  points.  Spec¬ 
ial  rules  for  computing  any  given  f  iz  are  usually  applied  to  ensure  observa¬ 
tions  are  made  from  different  locations. 

Although  no  new  random  error  is  introduced  in  separation,  other  error 
is  introduced  if  separation  is  not  done  correctly.  Both  jackknifing  and 
sequential  searching  may  fail  under  reasonable  conditions,  in  which  case  emit¬ 
ter  locations  or  signal  parametrise  estimates  based  on  them  may  be  "phantoms" 
and  not  represent  ezisting  radar  characteristics.  Not  only  can  this  provide  a 
false  target,  but  it  will  simultaneously  hide  at  least  one  true  one.  Thus 
separation  shapes  much  of  the  input  to  the  self-correlation  phase,  and  will  be 
considered  in  its  own  right  in  another  report. 

The  third  step,  and  the  focus  of  this  report,  is  called  self-oorrela- 
tion.  Candidate  radars,  specified  by  either  directly  collected  lines-of- 
bearing  and  signal  parametrics  or  their  estimates  coming  from  separation,  are 
compared  with  "known”  radars.  If  the  new  information  seems  to  refer  to  a 
radar  already  identified,  it  is  used  to  refine  what  is  known  about  that  radar 
location.  Many  self-correlation  algorithms  also  try  to  determine  if  the  candi¬ 
date  represents  observations  of  a  "known"  radar  which  has  moved  or  one  which 
has  shifted  its  signal  parametrics  logically.  A  candidate  which  cannot  be 
associated  with  any  radar  in  the  database  is  considered  a  "new"  radar  and  is 
added  to  the  database.  In  this  manner  the  database  of  "known”  radars  is 
built*  the  first  candidate  is  new  by  default  (since  the  database  is  empty),  the 
second  is  either  successfully  associated  with  the  first  or  is  added  as  a  second 


database  entry,  and  so  on.  When  a  database  entry  is  snffieiently  refined,  or 
exported  by  enough  candidates,  it  can  become  a  target  nomination. 

Rules  for  associating  candidates  with  database  entries  vary  from 
system  to  system.  They  usually  assume  that  location  estimates  are  normally 
distributed  and  signal  parametries  either  fall  between  intelligence-specified 
bounds  or  hare  some  statistical  distribution  whose  parameters  can  be  estimated. 
All  systems  analyzed  to  this  point  seem  to  assume  that  all  measured  attributes 
are  independent.  Some  systems  use  a  final  "measure  of  correlation”  for 
association  which  combines  the  various  measures  using  subjective  weighting 
factors. 

The  final  step  in  this  intelligence  analysis,  called  oross-correla- 
tion,  identifies  more  complex  entities,  such  as  Division  Command  Posts  vith  the 
known  simple  entities  of  which  they  are  composed.  These  simple  entities  are 
provided  by  previous  processing.  Either  the  simple  or  the  complex  entity  may 
be  the  new  candidate.  The  measures  of  assoc iation  used  rely  on  information 
contained  in  templates.  These  templates,  based  on  intelligence  estimates, 
indicate  what  specific  complex  targets  should  look  like*  The  problems  arising 
in  cross-correlation  will  be  studied  in  another  report. 

Turning  now  to  a  slightly  more  detailed  view  of  self-correlation, 
figure  1-2  concentrates  on  its  interface  vith  separation,  illustrating  the 
mathematical  assumptions  made  to  tie  the  system  together.  These  assumptions 
prescribe  which  mathematical  techniques  can  be  used  for  building  the 
algorithms. 

Two  levels  of  assumptions  underlie  the  distributions  used  for  location 
and  signal  parametries  in  the  self-correlation  algorithms.  The  first  is  that 
all  observations  used  to  calculate  an  estimate  were  made  of  the  same  emitter. 
Ground  truth,  the  array  of  emitters  with  which  these  sensor  system  must  actual¬ 
ly  cope,  includes  multiple  emitters.  Thus,  either  the  separation  algorithm 
itself  or  the  operational  capability  of  the  collection  sensor  system  is  assumed 
to  classify  observations  accurately,  differentiating  those  oom  ing  fro.  separate 
eaitters. 

The  seooad  level  of  aasuaptiona  deals  vith  the  shape  of  the  distribu¬ 
tions  of  the  radar  eharaeteriaties  and  the  statistical  independence  of  the 
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o  auy  emitters,  eaoh  with  random  noise 


lupin*  OittA  the  nlidity  of  the  first  assumption,  the  nbiutd  noml  dis- 
tribstioft  for  location  estimates  can  bo  established  bj  classical  statistical 
arguments,  Distributions  for  the  various  signal  paraaotrics  apparently  are 
dotorainod  by  Bayesian  techniques  supported  by  intellig ence-eoaaunity  esti- 
aatosf  whore  the  tern  Bayesian  is  understood  in  the  popular  sense  of  relying  on 
predeternined  priors  (probability  estiaates)  which  nay  be  nodified  by  inooaing 
data,  (All  the  statistical  inference  found  in  the  systeas  surveyed  is  classi¬ 
cal),  The  ubiquitous  tacit  assnaption  of  independence  aaong  signal  paraaet- 
rics,  underlying  their  joint  distribution  and  dictating  the  appropriate  statis¬ 
tical  tests,  seen  dubious  at  best;  even  location  and  signal  paraaotrics  are 
probably  aade  statistically  dependent  by  taetios.  Dependence  aaong  the  various 
radar  characteristics  aeasured  or  inferred  will  be  considered  in  greater 
detail  in  later  sections,  with  particular  attention  to  possible  relationships 
with  tine  of  observation. 

The  consequences  of  these  assuaptions  will  be  discussed  for  each 
systea  in  Appendix  E. 

2,  SUIVIfr  OP  SBLHBMEUIMN  ALGOBDUS  UK  mSTlMP  S1S1BMB 


The  aatheaatical  docuaentation  of  several  of  the  ELINT  radar  self¬ 
correlation  algorithas  has  been  surveyed,  Self-oorrolation  algorithms  compare 
new  intercept  estiaates  (candidate  radars)  with  prior  estiaates  (known  radars) 
of  location,  descriptive  signal  paraaetrios,  and  time  of  observation.  If  a 
match  is  found,  the  new  information  aay  be  used  to  update  the  old;  if  none  is 
found,  the  new  estiaate  is  added  to  the  database  of  "known"  radars.  Some 
data  analysis  systeas  also  take  into  account  possible  movement  or  change  in 
operating  signal  paraaetrios.  Figure  2-1  illustrates  a  general  self-correla¬ 
tion  algorithm. 

Two  sets  of  tests  ooapare  new  estiaates  with  those  already  in  the 
database.  One  is  based  on  radar  location,  the  other  on  signal  paraaetrios, 
such  as  frequency,  pulse  width,  pulse  repetition  interval,  and  tine  of  observa¬ 
tion,  Most  systeas  assuae  that  location  estiaates  are  normally  distributed  and 
base  these  tests  on  standard  statistics.  Tests  based  on  signal  paraaetrios  are 
handled  by  several  statistical  and  non-statlstioal  techniques.  These  tests, 
and  their  assumptions,  will  be  discussed  in  the  reaainder  of  this  section.  The 
focus  will  be  on  stationary  radars:  those  that  change  neither  their  signal 
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pirmtrios  nor  thci  r  location;  and  tb  •action  will  conclude  by  discus  ting  s 
specific  example,  the  radar  self-correlation  algorithms  in  the  BETA  system. 
The  disonssion  will  be  limited  to  general  statistical  forms  and  conceptr,  those 
interested  in  the  underlying  mathematics  should  oonsult  the  annotated  reference 
list  (Appendix  A).  The  specific  algorithms  used  in  the  systems  surveyed  will 
be  considered  in  Appendix  E, 

As  Figure  2-1  indicates*  systems  use  both  sequential  and  simultaneous 
decision  tests*,  simultaneous  tests  can  be  based  on  either  a  joint  distribution 
or  a  (usually  linear)  combination  of  tests  of  individual  characteristics. 
Before  looking  at  the  mathematical  form  of  the  tests,  or  measurements,  for 
individual  radar  characteristics  (location  and  various  signal  parametrics),  it 
will  be  useful  to  consider  the  mathematical  implications  of  choosing  sequential 
or  simultaneous  tests. 

Sequential  tests  normally  assume  that  the  characteristics  being  tested 
are  statistically  independent.  If  this  assumption  is  true,  sequentially  test¬ 
ing  hypotheses  based  on  the  one -dimensional  marginal  distributions  vill  even¬ 
tually  lead  to  rejection,  if  there  are  enough  dimensions,  whereas  "averaging" 
the  noise  over  all  the  dimensions  in  a  test  based  on  the  joint  distribution  may 
result  in  acceptance.  If  the  characteristics  are  correlated,  however,  a  pro¬ 
perly  constructed  test  based  on  the  joint  distribution  approaches  one  dimen¬ 
sionality,  making  the  sequential  approach  redundantly  test  the  same  thing,  so 
that  sequential  and  simultaneous  results  are  the  same.  Where  some  charac¬ 
teristics  are  correlated  and  some  are  not,  which  usually  implies  nonlinearity, 
the  outoomes  using  simultaneous  or  sequential  testing  are  hard  to  compare, 
unless  both  tests  are  carefully  constructed  to  reflect  the  same  behavior. 
Further,  by  using  the  joint  distribution,  behavior  of  individual  characteris¬ 
tics  is  obscured.  One  way  to  retain  some  control  over  the  influence  exerted  by 
individual  characteristics  on  the  outcome  of  the  test  and  to  provide  the  flexi¬ 
bility  of  easilly  performing  a  set  of  tests  sequentially  or  simultaneously  is 
to  use  the  weighted  sum  of  the  statistics  for  each  characteristic.  This 
approach,  and  some  of  its  possible  statistical  interpretations  and  derivations, 
will  be  considered  in  the  section  on  simultaneous  tests.  The  statistical  basis 
for  these  simultaneous  tests,  or  lack  of  it,  will  beoome  particularly  important 
when  diseussing  non-stationary  radars. 
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2*1*  Locution  TMt«  for  Stationary  Radars 

TRa  initial*  often  tacit*  at  snap  t  ion  nade  is  that  each  location  eati- 
aatc  coact  froa  a  set  of  observations  of  tbe  saae  radar.  The  consequences  of 
relaxing  tbit  assumption  will  be  discussed  later  in  tbis  report. 

Most  of  tbe  systems  surveyed  nse  a  chi-square  statistic  for  location 
tests*  either  as  an  individual  hypothesis  test  or  as  part  of  a  sianltaneous 
test.  Tbe  choice  is  based  on  established  statistics*  a  brief  heuristic  discus¬ 
sion  of  which  follows. 

Figure  2-2  shows  schematically  the  observational  data  used  to  deter¬ 
mine  tbe  location  estimate  X  and  elliptical  error  probable.  Different  platforms 
belonging  to  tbe  same  sensor  system  tabs  several  lines-of-bearing  to  tbe  radar* 
leading  to  measurement  error  in  the  angle  specifying  the  line-of-b earing  and* 
especially  in  airborne  platforms*  also  in  the  platform  looation.  These  are 
translated  into  error  in  tbe  looation  estimates  which  are  assumed  to  have  a 
bivariate  normal  distribution  with  mean  x  and  covariance  matrix  $•  The  level 
curves  of  tbis  joint  distribution  are  ellipses  (Figure  2-3)  and  not  circles 
chiefly  because  tbe  total  angle  of  observation  is  small*  could  tbe  sensors 
surround  tbe  radar*  tbe  level  curves  would  most  likely  become  circles. 

Now  letting  and  *2  be  tbe  means*  end  Sj  tbe  known  population 
covariance  matrices*  and  nj  and  n^  ^e  number  of  observations  in  two  samples* 
tbe  statistic 


X2-  <V*2)T(nl1V'  n2ls2)(v  V 

has  a  non-eentral  ohi-aquare  distribution  with  non-centrality  parameter 


L->  >  s  6  6 
ij  i  J 

.here  6i  1*  the  differ.noe  in  mean a  for  th.  ith  characteristic  and  are  the 
elements  of  S-nfls^  +  nj1^  inverse.  If  the  hypothesis  that  the  two  samples 
refer  to  the  same  radar  is  trne,  then  6j“0  for  i">l,2  (of.  Johnson  and  Leone, 
section  17.7). 
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Consider  the  model  of  the  observation  of  a  single  characteristic  nade 
by  tha  kth  sensor  sys ten. 

xk(t)ii+bk<t)  +  sk(t) 

▼here 

xk(t)  is  the  sample  mean  (estimated  location)  sampled  at  time  t 

p  is  the  true  location  of  the  radar 

bk(t)  is  the  location  bias  introduced  by  that  sensor  system  at 

time  t  and 

«k(t)  is  the  error,  usually  assumed  distributed  N(0,1)~  normal 
with  xero  mean  and  unit  variance 

If  "belonging  to  the  same  population91  means  "observing  the  same  radar"  - 
and  that  is  what  self-correlation  is  testing  -  then  the  non-centrality  parame¬ 
ter  is  not  zero,  if  different  sensor  systems  introduce  different  biases,  even 
though  they  are  observing  the  same  radar.  Thus,  for  the  statistic  X^  to  have  a 
central  chi-square  distribution,  as  is  usually  assumed,  the  observations  must 
be  unbiased,  or  all  have  the  same  bias.  Further,  the  error  must  be  invariant 
with  respect  to  time.  Although  this  may  initially  seem  to  be  a  harmless 
assumption,  error  probably  does  depend  on  range  which  changes  with  time. 
Potential  dependence  of  measurement  error  on  distance  to  the  emitter  usually 
can  be  accounted  for,  and  in  no  case  should  be  overlooked. 

Two  different  methods  of  handling  the  location  tests  were  found  in  the 
systems  surveyed.  The  first,  part  of  a  sequence  of  tests,  was  a  central  chi- 
square  test  for  a  predetermined  confidence  level  (1-a)  using  the  statistic  X*. 
The  second  involved  simultaneous  tests  with  signal  parametrios,  needed  to 
convert  the  chi-square  statistic  value  into  a  value  compatible  with  the  oth0r 
tests,  in  particular,  to  map  it  into  [0,1]*,  the  functions  used  for  this  napping 
or  transformation  were  usually  of  the  form 


where  x  is  the  chi-squared  statistic.  As  it  is  the  "quadratic  fora”  for  beta 
equaled  to  2,  the  above  equation  gives  the  unuoraalized  probability  density  for 
the  bivariate  normal  distribution  and  is  the  standard  mapping  of  a  chi-squared 
distribution  onto  the  unit  interval.  Either  of  these  approaches  is  reasonable 
as  long  as  the  estimates  are  unbiased*  the  samples  statistically  independent* 
and  the  distributions  normal*  and  normality  is  a  standard  assumption  for  dis¬ 
tribution  of  sample  means, 

2.2.  Signal  Parana  tries  Tests  for  Stationary  Radars 

As  stated  above*  s  stationary  radar  is  not  only  one  that  is  not  mobile 
but  also  has  no  basic  changes  in  its  operating  signal  parametrics.  Although 
tests  on  time  of  observation  usually  appear  with  these  signal  parametrics  their 
discussion  will  be  postponed  to  the  section  on  n on-stationary  emitters.  Unlike 
those  for  location*  there  is  no  veil-developed  statistical  literature  for  these 
characteristics.  The  three  main  approaches  to  hypothesis  testing  found  in  the 
systems  surveyed  were: 

1)  test  whether  the  new  characteristic  estimate  is  within  preestab¬ 
lished  limits  or  pre established  bounds  of  the  known  estimate** 

2)  calculate  a  value  (to  be  used  in  a  simultaneous  test)  using  a 
simple  function  of  the  difference  between  new  and  known  charac¬ 
teristic  estimator* 

3)  calculate  individual  measures  of  correlation  for  each  character¬ 
istic  (to  be  used  in  a  simultaneous  test)  that  reflect  the  proba¬ 
bly  that  the  new  estimate  falls  within  the  (1-a)  confidence  band 
of  the  known  estimate. 

These  three  approaches  will  be  considered  separately. 

Tests  on  whether  the  new  estimate  falls  within  a  given  interval  are  of 
two  types. 

1)  Those  for  which  all  known  operating  intervals  of  enemy  radars  are 
predetermined*  so  for  two  observations  to  be  of  the  same  radar* 
they  must  both  lie  in  the  same  interval* 
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2)  Those  for  which  the  variability  about  a  given  signal  pa  rase  trio 
for  enemy  radars  is  predetermined,  so  the  new  estiaate  must  lie 
within  that  distanoe  of  the  known  one. 

The  first  type  has  no  statistical  content*,  it  is  a  deterainistio 
decision  based  on  certain  prior  knowledge.  The  second  lends  itself  to  a  stat¬ 
istical  interpretation,  especially  if  real-tine  data  is  used  to  aodify  the 
bounds,  leading  to  a  "Bayesian"  approach.  Neither  of  these  need  deal  with 
the  classical  statistics  arising  froa  errors  in  measurement.  The  second  tech¬ 
nique  aay  take  into  account  priors  arising  froa  measurement  methods  as  well  as 
radar  performance,  but  that  can  not  be  determined  by  the  mathematical  form  of 
the  test  alone.  Such  motivations  are  known  only  to  the  creator  of  the  "data¬ 
base"  of  predetermined  bounds. 

The  second  class  of  tests,  those  using  a  simple  function  of  the  diffe¬ 
rence  between  new  and  known  estimates,  are  designed  to  be  used  in  simultaneous 
tests,  considering  several  signal  parametrics  with  or  without  location.  The 
functions  are  used  to  emphasise  differences  between  estimates  in  certain 
ranges,  to  map  the  differences  into  [0,1]  so  they  oan  be  combined,  or  to 
reflect  some  assumed  distribution  for  that  characteristic.  These  distributions 
may  reflect  variability  in  enemy  radar  performance  or  friendly  sensor 
observations. 

The  third  approach  assumes  some  distribution  for  th3  estimates,  usual¬ 
ly  normal.  It  also  assumes  that  the  (1-0)  confidence  intervals  for  the  new  and 
known  estimates  are  given  (they  are  not  given  in  the  unclassified  portion  of 
the  TACELINT  message  format  given  in  the  referenced  technical  directive).  The 
measure  of  correlation  is 


known) 


*~X 


new' 


where  (1-a  &ev)  indicates  the  closed  1-a  confidence  interval  for  the  new  esti¬ 
mate  and  (1-a  known^  ^  known  estimate.  Given  the  above  assumptions  and 
statistical  independence  of  measurement,  this  equation  gives  the  conditional 
probability  of  an  estimate  coming  from  the  known  population,  given  it  is  known 
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to  oome  fro®  the  new  population,  lienee  a  probability  that  the  estimates  come 
fro®  tbe  same  population.  Unlike  tke  teats  discussed  earlier  that  rest  heavily 
on  priors,  this  test  is  based  on  the  variability  arising  in  sampling  (that  is, 
in  obserring  enemy  radars).  If  there  is  variability  in  the  enemy  radar  para¬ 
metric  around  some  fixed  value,  this  variability  vill  be  reflected  (interming¬ 
led  with  variability  arising  from  the  measurement  technique)  in  this  measure  of 
correlation.  To  separate  these  two  contributions  to  the  variance  requires 
other  statistical  techniques,  and  the  relevance  of  the  information  obtained  to 
this  intelligence  problem  it  unclear. 

2.3.  Simultaneous  Tests  for  Stationary  Radars 

Many  of  the  values  discussed  above  were  designed  to  be  used  in  simul¬ 
taneous  tests.  Two  forms  of  these  tests  will  be  considered: 

1)  the  weighted  (oonvex)  sum  of  values  for  individual  characteris¬ 
tics,  and 

2)  a  cumulative  point  test. 

These  are  indeed  the  same  kind  of  test,  cumulative  point  tests  being  a  discrete 
version  of  weighted  sums.  Cumulative  point  tests  use  a  predetermined  set  of 
values  for  eaoh  characteristic*  among  which  of  these  values  the  test  value 
falls  (often  the  difference  between  estimates)  determines  the  number  of  points 
assigned  for  that  characteristic.  These  points  are  summed,  and  the  decision  to 
accept  the  new  and  known  estimates  as  referring  to  the  same  radar  is  based  on 
that  sum.  Determining  the  oriteria  for  assigning  points  is  similar  to  deter¬ 
mining  which  functions  will  map  the  difference  in  estimates  (for  signal  para- 
metrios)  or  the  chi-square  value  (for  location)  into  [0,1]  and  to  determining 
the  set  of  weights. 

The  weighted  linear  sum  is  often  known  in  statistical  literature  as  a 
"linear  discriminant*9.  Weights  and  mapping  functions  are  chosen  to  enhanoe 
its  ability  to  distinguish  between  populations  based  on  the  characteristics 
being  measured.  Without  any  assumptions  on  the  statistical  properties  of  the 
characteristics,  this  is  an  often  useful  tool  for  constructing  a  hyperplane 
separating  different  sets  of  observations  (those  from,  it  is  hoped,  different 
radars)  in  N  dimensional  space,  where  N  is  the  number  of  independent  charae- 


teristios.  Hot#  that  dependence  between  characteristics  ia  reflected  by  fewer 
dimensions  of  the  hyperplane  than  there  are  observed  characteristics,  and  the 
weights  assigned  to  dependent  parameters  control  their  contribution  to  the 
independent  dimension  they  define. 

To  make  statistical  statements  based  on  this  disoriminant,  further 
assumptions  about  the  characteristics  are  required.  The  classical  assumptions 
are  that  the  characteristics  are  normally  distributed  (true  for  most  mean 
estimates  based  on  large  samples),  are  independent  (this  will  be  addressed 
later),  and  have  equal  oovariance  matrices.  Under  these  assumptions  a  linear 
discriminator  with  usually  calculable  coefficients  can  be  obtained,  whose 
statistical  behavior  is  known. 

One  observation  on  the  functions  applied  to  the  individual  character¬ 
istic  differences  is  that  these  functions  are  the  vehicle  to  carry  distribu¬ 
tional  information  about  the  characteristics.  The  functions  chosen  in  the 
systems  surveyed  are  almost  invariably  those  classically  used  to  represent 
ignorance,  not  prior  knowledge. 

Finally,  note  that  even  from  this  general  view  of  their  form,  signal 
parametrics  tests  are  seen  to 

1)  discriminate  between  types  of  radars,  but  probably  not  individual 
units  unless  true  tests  are  very  precise,  and 

2)  support  decisions,  but  not  make  statistical  inferences  with 
confidence. 


These  limitations  will  be  seen  to  become  more  important  for  nos-stationary 
radars. 


2.4,  Horn-Stationary  Radars 

As  mentioned  in  Section  2.3 ,  the  tests  discussed  so  far  indicate 
whether  a  type  of  radar  observed  at  a  specific  location  is  the  same  as  that  in 
a  previous  siting.  With  straightforward  modification  the  tests  also  can  iden¬ 
tify  those  radars  whose  signal  characteristics  may  vary  in  a  prescribed  manner 
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a«  belonging  to  tbs  sam#  type.  Depending  on  the  accuracy  of  the  weapon  system 
to  be  employed  this  information  may  be  sufficient  for  targeting,  the  specifio 
pieoe  of  equipment  at  a  location  being  unimportant  as  long  as  it  is  known  that 
a  radar  of  a  given  type  is  there.  However,  for  other  intelligence  analyses,  it 
nay  be  important  that  the  radar  that  was  there  has  left  and  a  new  one  of  the 
same  type  taken  its  place. 

The  standard  approach  to  testing  for  a  moving  radar  is  that,  whenever 
there  is  no  database  entry  close  enough  to  the  new  siting  measured  by  some  chi- 
squared  value,  database  entries  for  the  same  type  radar  (if  it  is  a  mobile 
type)  within  the  movement  radius  of  the  new  siting  are  considered  as  possible 
matches,  as  shown  in  Figure  2-1.  To  draw  a  statistical  inference  concerning 
whether  the  radar  moved  requires  statistical  tests  for  characteristics  of  that 
radar,  not  just  that  type  of  radar.  The  only  truly  statistical  inference  being 
drawn  by  the  tests  discussed  above  is  for  the  location,  and  this  test  is 
abandoned  with  moving  radars.  To  construct  the  tests  required  for  statist i— 
cally  testing  hypotheses  about  the  radar  itself  requires 

1)  a  statistical  test  of  signal  characteristics,  including 

2)  the  time  of  beam  initiation. 

The  tests  must  be  able  to  not  reject  the  hypothesis  that  the  new  looatiom  is  of 
the  same  radar  while  rejecting  the  hypothesis  that  the  radar  is  in  the  old 
location,  even  if  it  has  been  replaced  by  one  of  the  same  type. 

Such  specific  identification  by  radar  instead  of  by  type/ location  pair 
becomes  important  when  results  are  fed  into  oross-oorrelation.  Unit  deployment 
depends  heavily  on  terrain,  so  any  unit  occupying  a  given  terrain  is  likely  to 
deploy  its  radars  in  the  same  locations*  this  is  reflected  in  oross-oorrelation 
templates.  If  a  new  unit  takes  the  place  of  another,  if  its  radars  are  of  the 
same  type  and  in  approximately  the  same  location  as  those  of  the  former  unit, 
new  and  known  estimates  will  match  in  self-correlation,  attaching  the  radar  to 
the  unit  that  has  just  moved  in.  It  is  true  that  this  site  no  longer  belongs 
to  the  old  unit,  and  if  sites  were  being  kept  in  the  database  links  should  be 
broken*  but  the  radars  formerly  linked  to  the  old  unit  still  are  attached  to 
it,  at  some  new  looatiou.  If  eross-oorrelation  is  the  linking  of  equipa-snt. 


not  sites,  to  units  (sad  bsosass  salts  sots,  their  equipment  does  also),  sack 

break  lag  aad  reform  lag  of  liaka  loses  continuity  of  information  and  leads  to 

/ 

confusion. 

Although  the  eharaoter  of  signal  pares# trios  aad  their  tests  will  be 
discussed  sore  completely  later,  the  particular  role  of  observation  time  will 
be  introduced  here*  Current  incorporation  of  time  into  the  tests,  usually  in 
the  linear  discriminant,  seems  mainly  to  be  based  on  hov  long  doctrine  says  a 
given  radar  will  operate  vith  the  sane  characteristics  in  the  same  location. 
Certainly  a  significant  time  is  required  for  one  unit  to  leave  an  area  and 
another  to  redeploy  there*  and  this  time  is  quite  terrain  dependent  (which  does 
not  seem  to  be  taken  into  account  in  movement  rate  or  set-up/tear-down  times). 
Such  non-statistioal  decision  tests,  however,  can  only  give  one  a  “good  feel¬ 
ing9'  that  indeed  the  radar  has  moved  from  one  place  to  another  because  the 
right  amount  of  time  has  elapsed.  Without  statistical  hypothesis  tests  on 
observed  characteristics,  radar  movement  can  not  be  inferred  with  any  confi¬ 
dence.  If  unit  movement,  and  hence  tactics,  is  being  inferred  from  equipment 
movement,  unsupported  “good  feelings"  can  quickly  compound  into  disaster. 

The  operating  historv  of  a  radar,  expressed  as  a  time  history  of  some 
of  its  characteristics,  seems  a  likely  candidate  for  statistical  hypothesis 
testing.  This  requires  time  tests  emphasizing  very  short  rather  than  long 
windows.  Thus  time  may  be  crucial,  although  not  as  a  characteristic  in  its  own 
right  treated  independently  of  other  characteristics,  but  orucial  in  its  rela¬ 
tionship  to  the  other  characteristics  including  location* 

2.5 •  Self-Correlation  in  HBXA  -  Am  Example 

BETA  is  a  testbed  system  for  correlating  reports  from  many  different 
types  of  intelligence  systems.  It  has  self-correlation  algorithms  for  radars, 
radios,  “movers,"  "shooters,"  compounds,  and  complexes  and  has  a  cross¬ 
correlation  algorithm*  This  discussion  will  center  on  the  radar  self-oorrela- 
tion  algorithm.  BETA  correlation  is  illustrated  in  Figure  2-4* 

BETA  nses  a  simultaneous  test  that  is  a  linear  diacriminant  based  on 
five  characteristics:  location,  time,  frequency,  pulse  repetition  interval, 
and  pulse  duration*  The  figure  of  merit  for  eaoh  of  these  is  defined  so  as  to 
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Fig.  2-4:  HEI  Ptomu  Comtrol  Flow 
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li«  in  [0,11,  the  measure  of  correlation  (linear  diser iainant)  being  their 
weighted  sun.  The  weights  nay  be  adjmeted  by  the  operator  to  refine  the 
"screening  power19  of  the  test  —  that  is,  to  oontrol  the  fraetion  of  the 
oases  handled  antoaatioally  or  to  rebalaaee  the  probabilities  of  associating 
two  estiaates  that  really  refer  to  different  radars  (type  I  error)  and  of  not 
assooiating  two  that  do  (type  II  error)  —  or  to  respond  to  the  tactical 
situation*  Weights  are  noraalized,  aaking  a  convex  combination  of  characteris¬ 
tics*  if  information  is  missing  for  soae  characteristics,  the  weights  are 
renoraalized.  These  factors  and  others  affecting  the  choice  of  weights  are 
discussed  in  Appendix  D. 

The  figures  of  merit  for  the  individual  characteristics  have  three 
basic  forms. 

1)  For  location  it  is  where  x  is  the  chi-squared  quadratic 

form  from  the  multidimensional  normal  distribution. 

2)  For  time  it  is  max  (0,1-d/B),  where  d  is  the  linear  distance 
between  the  two  observation  time  intervals  and  B>0  is 
predetermined. 

3)  For  signal  characteristics  it  is  1,  if  the  absolute  difference 
between  the  closest  values  for  the  characteristic,  and  02  1* 
smaller  than  soae  predetermined  error,  max  (0,l-(o^/C2-M)/(B-M)) 
where  e|)C2»  and  B>M  are  predetermined  positive  bounds. 

For  the  tests  for  frequency  and  pulse  repetition  interval,  N  is  one.  The  loca¬ 
tion  test  is  the  standard  aapping  of  the  chi-squared  statistic  onto  the  unit 
interval,  giving  a  nonmormalized  normal  density  function.  The  tine  test  falls 
off  linearly  as  the  tiaes  of  observation  draw  farther  apart*,  this  is  in  faot 
the  distribution  function  for  a  uniform  distribution.  The  signal  characteris¬ 
tics9  measures  are  seen  to  have  the  same  form,  with  atoms  possible  at  the  ends. 
Note  that,  as  mentioned  above,  all  known  or  suspected  underlying  distributions 
are  "non-inf ormetive",  those  used  usually  to  minimize  the  (unspecified)  worst 
ease  losses  in  the  oase  of  ignorance.  Also  when  they  are  used,  the  character¬ 
istics  are  usually  assuaed  to  be  statistically  independent. 
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k  furtlur  mlytli  of  OTA  it  given  in  Appendix  0 


3*  MJJOWi  AiSMTIIS  QM  LKATSM  BfTDttRS 

When  ■ a my  of  the  assumptions  stated  ftboTft  an  relaxed,  or  do  zot 
hold,  there  It  ao  immediate  ttfttistioftl  tool  to  replace  the  oao  lost.  The 
tffoot  of  relaxing  tia  following  three  assumptions  will  bo  eoaaidortd  iA  tbit 
section. 

1)  Tht  variance  of  tbo  loefttioa  tztiaftt#  it  known. 

2)  Hit  location  tttiaftto  it  unbiased. 

3)  The  obttrrttiont  froa  which  tht  location  tttiaatt  it  derived  all 
rtf  or  to  tht  taat  emitter. 

Whenever  possible,  alternate  approaches  will  also  be  considered. 

3.1.  Known  Variance 

Knowing  the  varianoe  of  the  looation  estiaates  aade  if  possible  to  nse 
the  chi-squared  distribution  to  teat  the  equality  of  the  two  estiaates  (observ¬ 
ed  staple  aeaat).  This  variance  it  patted  to  the  self-correlation  algorithm  by 
the  observing  tensor  system.  It  would  usually  be  deterained  by  assuming  that 
the  population  variance  of  the  observations  is  known,  and  dividing  it  by  the 
number  of  observations.  This  observation  variance  is  just  the  variance  asso¬ 
ciated  with  the  observation  error.  It  nay  well  be  range  dependent,  but  even 
so,  knowing  it  for  a  set  of  ranges  for  a  system  that  can  be  extensively  field 
tested  should  be  possible. 

If  the  observation  variance,  and  thus  the  varianoe  of  the  sample 
aeans,  is  not  known  and  its  unbiased  estimate  used  instead,  the  choice  of 
statistical  test  depends  on  sample  size.  For  an  hypothesis  that  the  mean  of 
the  population,  froa  which  one  sample  was  drawn  equals  a  fixed  quantity.  Hotel¬ 
ling^  T*  statistic  is  used  for  smaller  samples.  This  statistic  has  an  F 
distribution  and  is  the  aultidiaensional  analogue  of  the  Student9s  t  statistic. 
However,  an  F  distribution  is  the  ratio  of  two  ohi-squared  distributions 


divided  by  their  niptctin  d«|r«t»  of  frMdoi,  la  tbit  otu»  the  nuerttor  is 
one  half  tbs  obi-squared  ststistio  assd  to  tost  tbs  hypothesis  that  tbs  asaa 
takss  oa  a  given  talus*  if  tbs  rariaaos  is  known*  aad  tbs  denoaiaator  is  a 
distribatioa  of  saaple  variances.  Thus*  tbs  hypothesis  tsstsd  by  a  T-squared 
statistic  is  that  tbs  estimates  ooas  fro*  tbs  saas  distribatioa*  that  both 
tbsir  asaas  aad  rariaaoss  ars  tbs  saas.  Tbs  bypotbssis  appropriats  ia  self- 
eorrslatioa  is  tbat  tbsy  rsfsr  to  tbs  saas  eaitter*  that  is*  that  tbs  asaas  ars 
tbs  saas.  Tbs  asaas  dspsad  oa  saittsr  location  aloas*  bat  tbs  rariaaos  dspsads 
oa  tbs  ssasor  systsa  and  ars  not  tbs  saas  for  different  sensor  systsas.  Tb»* 
tbs  T-squared  statistic  aay  bs  assd  only  if  there  is  only  one  ssasor  systsa  and 
asasareaent  dsrirsd  rariaaos  is  not  range  dependent. 

For  a  aulti-sensor  data  analysis  systsa  it  is  better  to  ass  tbs  chi- 
sqaarsd  ststistio  with  tbs  tro  known  and  probably  ansqaal  rariaaoss.  Tbit 
rsqairss  tbat  tbs  obssrratioas  bs  safficisatly  naasroas  for  tbs  nabiased  esti- 
■ator  of  tbs  rarianos  to  bs  a  saitabls  surrogate*  or  tbat  tbs  observation 
rariaaos  bs  known  froa  field  testing  and  be  aonitored  (probably  by  a  sample 
variance  equals  hypothesized  popalation  variance  one  dimensional  F  test  on  tbs 
angle  of  observation)  to  ensare  that  it  does  no  change  significantly  under 
coabat  conditions. 

3.2*  Unbiased  Mean 

A  fairly  obrioas  point  tbat  sboald  bs  mentioned  is  tbat  the  estimates 
aast  bs  unbiased.  Unbiased  is  assd  bare  in  tbs  saas  sense  as  in  tbe  model 
mentioned  above* 


x-p+b+s 

vbsrs  b  is  zero*  aad  x  is  tbs  saaple  asaa  froa  sons  systsa*  s  its  error.  Tbe 
bias  b  ooald  bs  nonzero  froa  two  oaaaes: 

1)  hardware  bias  in  tbs  asasareaent  and 

2)  software  bias  introdaosd  by  data  analysis  aad  estimation 
techniques. 
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Bias  in  the  statistioal  estimation  itself  can  be  avoided  by  using  known  un¬ 
biased  estimates  of  tke  mean.  Note  that  tkis  also  requires  an  unbiased  data 
integration  method  for  refining  estimates  already  in  the  database  during  self- 
correlation*  as  one  of  the  samples  being  tested  is  always  the  database  entry. 
Appendix  D  will  consider  such  data  integration  methods  for  location  estimates. 
Possible  bias  from  hardware  or  other  software*  if  it  is  well  understood  (again 
through  field  testing)*  can  be  adjusted  for  before  the  estimated  location  is 
passed  on  to  self-correlation. 

3.3.  Single  bitter  Population 

Homing  in  on  false  images  is  not  restricted  to  intelligence  data 
analysis  systems.  The  "centroid  problem"*  an  intelligent  missile  seeing  two 
emitters  and  seeking  their  oentroid*  plagues  stsnd-off  miasile  design.  When 
the  environment  is  target  dense*  the  targeteer  has  trouble  deciding  between  so 
many  often  valuable  targets,  but  the  intelligence  analyst  has  trouble  separat¬ 
ing  observations  into  samples  representing  only  one  emitter  so  that  the  tar¬ 
geteer  has  a  target  and  not  a  phantom.  The  problem  is  real'*  and  current  Soviet 
trends  indicate  the  target  density  will*  if  anything*  increase. 

Separation  can  be  based  on  looation*  or  signal  parameters*  or  both. 
Separation  of  radars  of  the  same  type  based  on  location  will  be  considered  in 
this  section  and  signal  parametries  in  the  next.  Unfortunately,  the  majority 
of  the  most  useful  information  for  separation  is  contained  in  the  observations 
and  has  been  averaged  out  by  the  time  a  location  estimate  and  EE P  is  passed  to 
self-correlation.  The  best  self-correlation  can  do  is  try  to  identify  phantoms 
so  that  "known"  database  entries  are  not  contas  inated*  for  a  phantom  by  its 
very  nature  may  correlate  with  one  of  the  true  targets  it  is  hiding*  thus 
pulling  that  estimate  farther  from  its  true  value. 

Examining  the  properties  of  phantoms*  and  the  situations  in  which  they 
are  likely  to  arise*  suggests  a  few  rul  ea-o  f-thumb  for  their  identification. 

1)  An  estimate  strongly  supported  by  at  least  two  sensor  systems  is 
probably  not  a  phantom. 

2)  Phantoms*  as  a  olass*  have  larger  varianees  than  true  estimates. 
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3)  When  a  oftndidfttft  estimate  ftftftoeifttfts  strongly  with  norft  than  one 
known  ittiantii  one  of  them  any  bs  ft  phantom. 

The  ooeervations  leading  to  these  will  be  considered  separately. 

Unless  the  mathematics  of  their  separation  algorithm  is  identical  - 
and  to  this  point  no  two  algorithms  that  are  sufficiently  the  same  have  been 
seen  -  different  sensor  systems  will  tend  to  produce  different  phantoms  in  the 
same  situation.  This  follows  because  the  phantoms  being  considered  here  are 
artifacts  of  the  mathematical  separation  algorithms  used  to  identify  observa¬ 
tions  of  the  same  emitter.  Thus,  if  a  location  is  strongly  supported  by  at 
least  two  systems,  it  is  probably  not  a  phantom  of  either.  This  also  points 
out  a  benefit  of  having  different  mathematics  in  different  systems,  in  the 
absence  of  a  phantomless  algorithm  whose  development  in  a  target-dense  environ¬ 
ment  is  unlikely. 

Phantoms  occur  where  two  or  more  emitters  of  the  same  type  are  so 
clustered  that  their  centroid  is  within  sensor-system  tolerance  of  observations 
of  each  -  that  is,  targets  are  dense  with  respect  to  measurement  sensitivity. 
Thus,  the  observations  arc  drawn  from  at  least  two  populations;  and  it  is 
likely  that  the  sample  variance  will  be  larger  than  if  all  observations  came 
from  only  one  emitter  population.  Also,  sinoe  the  controlling  parameter  is  the 
angle  of  observation  the  following  statements  can  be  made: 

1)  the  true  observations  are  more  apt  to  lie  along  the  major  than 
minor  ellipse  axis  for  standard  shaped  ellipses, 

2)  a  suspiciously  circular  EEP  is  probably  a  plantom  with  true 
locations  along  its  minor  axis*,  and 

3)  phantoms  are  more  apt  to  occur  at  greater  ranges. 

Thus,  suspicion  can  be  cast  on  an  estimate  by  only  knowing  its  EEP. 

Strong  association  between  (or  among)  two  or  more  known  estimates  may 
indicate  that  one  is  a  phantom.  In  the  missile  centroid  problem  described 
above,  the  phantom  will  lie  at  a  weighted  centroid  of  the  true  locations  it  is 
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hiding.  If  tk«r«  ar«  only  two,  it  will  lie  approximately  on  tha  lino  between 
them  (tha  deviation  from  tha  line  ariaaa  because  all  yalnes  are  aatimataa,  not 
trna  locations).  Unfortnnantaly,  for  three  or  more  true  locations,  tha  phantom 
can  lie  anywhere  within  their  convex  hull,  depending  on  how  many  observations 
there  are  of  eaoh  emitter.  However,  since  many  separation  algorithms  give  true 
emitter  estimates  if  at  least  $0%  of  the  original  set  of  observations  comes 
from  one  emitter,  and  are  most  likely  to  produce  phantoms,  if  the  observations 
come  50%  from  each  of  two  emitters,  phantoms  will  most  likely  be  around  the 
centroid  of  the  true  locations.  Simple  terrain  checking  in  such  situations  may 
indioate  that  no  radars  would  be  sited  in  that  pattern  (even  taking  into 
account  EEP). 

These  very  heuristic  rules-of-thumb  give  some  idea  of  the  mathematical 
severity  of  the  problem,  if  phantoms  are  input  to  self-correlation  algorithms. 
Once  their  information  is  integrated  with  a  known  estimate,  damage  seems  irre¬ 
parable;  and  based  solely  on  mathematics  of  location,  there  seems  no  certain 
way  to  weed  out  the  phantoms.  Some  consideration  based  on  signal  parametrics 
will  be  discussed  in  the  next  section. 


The  basic  ELINT  problem  concerns  the  location  and  identification  of 
signal  parametrics  for  non-communications  emitters  based  on  observations  made 
by  sensors.  By  non-communications  emitters  we  mean  radars  and  certain  BCM 
devices.  Subsequent  evaluation  of  these  ELINT  observations  is  dependent  upon 
statistical  assumptions  made  regarding  these  observations.  This  section  consi¬ 
ders  the  statistical  assumptions  that  are,  and  are  not,  appropriate  for  these 
ELINT  observations. 

It  is  useful  to  first  discuss  the  ELINT  problem  from  the  single  sensor 
viewpoint  with  respect  to  the  measurement  of  the  signal  parametrios  of  an 
emitter.  Emitters  to  be  sensed  may  be  monostatic1,  bistatio1,  or  multistage’ 
but,  this  analysis  of  emitter  signal  parametrics  will  be  restricted  to  the 
monostatic  ease.  The  dependencies  inherent  in  the  bistatio  and  multistatio 
emitter  oases  will  be  deferred, 

”  Monostatic  radar  -  a  single  radar 
1  Bistatio  radars  -  a  pair  of  cooperative  radars 

Multistatio  radars  -  a  collection  of  cooperative  radars 
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The  first  Approach  to  eaitter  signal  characteristics  is  aade  hers 
without  respect  to  their  aeasureaent.  Farther,  the  disoassion  will  be  aboat 
only  the  characteristics  of  relatively  ns ire  ea  it tors. 

The  typical  eaitter  will  radiate  a  foraed  beaa  of  eleotroaagne tic 
energy.  The  freqnency  of  this  eaission  nay  be  fixed  or  varying  in  soae  fashion 
and  nay  be  oontiaaoas  or  interaittent.  The  beaa  is  polarised  and  aay  be 
rotated,  fitted,  natated,  etc.  Farther  the  eaitter  aay  be  aonnted  on  a  aoving 
platfora,  thas  continaoasly  or  interaittently  changing  position,  velocity,  and 
the  aspect  of  the  beaa. 

Since  a  radar's  inaction  is  inferred  throagh  its  signal  paraaetrios, 
there  is  an  iaplicit  general  dependence  aaong  the  signal  paraaetrios  which  will 
not  be  considered  here.  Ve  will  consider  the  characteristics  and  dependencies 
of  soae  of  the  signal  paraaetrios.  Radar  eaissions  (carriers)  aay  be  con- 
tinaoas  or  interaittent.  Continaoas  eaissions  aay  be  of  fixed  or  varying 
frequencies  depending  on  their  purpose.  Interaittent  eaissions  aay  be  repeti- 
tive  or  non-repetitive. 

Repetitive  eaissions  are  characterized  by  their  pnlse  repetition 
interval  and  their  pnlse  oharaoteristics  (width,  shape,  nnifora  and  nou-unifora 
bursts).  Further,  the  carrier  frequency  aay  be  varying  during  each  pulse. 
Either  the  pulse  repetition  interval  or  pulse  width  or  both  aay  vary  over  soae 
range  of  values  in  a  nnifora  or  non-uni  fora  fashion. 

Non-repetitive  eaissions  are  generally  characterized  the  saae  as  repe¬ 
titive  eaissions  over  short  tiae  intervals.  The  non-repetitiveness  is  intro¬ 
duced  by  tiae,  carrier  frequencies,  and  geographic  diversity  aaong  a  set  of 
cooperative  radars  as  in  bistatic  and  aultistatic  systeas.  For  our  purpose,  ve 
will  consider  eaoh  of  these  cooperative  radars  as  a  separate  eaitter. 

A  reasonable  subset  of  signal  paraaetrios  required  to  locate  and 
identify  a  radar  includes  the  nature  of  the  transaission,  i*e.»  continuous  (CW) 
or  pulsed,  and  the  following  signal  oharaoteristios: 
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1)  oirrUr  frequency 


2)  pulse  width 

3)  pulse  repetition  interval 

4)  beam  scan  type 

5)  bean  scan  rate 

6)  bean  polarisation 

The  location  is  specified  along  with  the  elliptical  error  probable 
(EEP)  and  relative  bearing  of  the  semi-major  axis.  The  semi-major  and  semi- 
■inor  axes  of  the  EEP  are  both  dependent  on  the  angular  accuracy  of  the  sensor* 
hence*  are  not  independent  measures. 

The  carrier  frequency  is  an  important  parameter  in  both  CV  and  pulsed 
radars  and  nay  be  either  constant  or  nodulated,  as  in  chirp  or  linear  FM 
radars.  This  characteristic  implies  the  need  for  instantaneous  as  veil  as 
average  frequency  neasurenents.  Since  only  average  frequency  is  included  in 
the  Tactioal  Electronic  Intelligence  (TACELINT)  message,  the  frequency  charac¬ 
terisation  is  incomplete. 

There  is  an  intrinsic  dependence  between  pulse  width  and  pulse  repeti¬ 
tion  interval  through  the  peah-to-average  power  ratio  required  for  a  specific 
radar  performance  factor.  Even  for  independently  varying  pulse  widths  and 
intervals,  this  lack  of  independence,  on  the  average,  remains. 

Pulse  width  measurements  must  take  pulse  multiplicity,  width  agility, 
and  shape  into  consideration.  However,  only  a  single  measure  of  pulse  width  is 
included  in  the  TACELINT  message. 

Pulse  repetition  interval  must  take  into  account  interval  staggering 
with  multiple  stagger  legs.  But  the  TACELINT  message  includes  only  pulse 
repetition  interval  and  whether  it  is  fixed,  staggered,  or  jittered. 
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formed  latiftfli  b«u  is  described  by  to«&  typt,  sets  rats,  sad 
polarisation*  Useful  measures  not  inolndad  in  the  TACELINT  passage  aro  the 
horizontal  and  vertical  bean  width  and  bean  multiplicity. 

There  are  two  conclusions  to  be  drawn  fron  this  brief  diseussion  of 
radar  signal  paranetrios: 

1)  these  signal  paranetrios  are  not  all  independent  of  one  another, 
and  oertainly  not  of  their  intended  function, 

2)  the  set  of  signal  paranetrios  included  in  the  TACELINT  message  is 
insufficient  to  "fingerprint"  a  specific  radar  emitter  as  dis¬ 
tinct  from  another  of  the  same  model. 

The  sensors  measure  the  location  and  signal  paranetrios  which  lead  to  the 
production  of  TACELINT  messages.  There  are  four  distinct  cases  to  consider: 

1)  single  sensor,  single  emitter, 

2)  single  sensor,  multiple  emitters, 

3)  multiple  sensors,  single  emitter, 

4)  multiple  sensors,  multiple  emitters. 

Since  a  single  sensor  can  only  provide  a  1 in e-of-b earing  to  an  emitter 
and  a  set  of  measures  of  the  emitters  signal  paranetrios,  the  sensor  must  be 
mounted  on  a  mowing  platform  to  obtain  a  "fix"  on  the  emitter  using  multiple 
lines-of-bear ing  from  different  locations  at  different  times.  The  variances 
associated  with  each  senaor  of  an  emitter  are  the  same  because  these  variances 
derive  from  the  sensor  characteristics  only.  With  multiple  emitters,  the 
location  and  signal  paranetrios  measures  may  differ,  but  the  variances  asso¬ 
ciated  with  them  remain  the  same. 

The  situation  with  multiple  senaor  systems  is  quite  different:  the 
"fixes"  on  a  emitter  are  usually  made  based  on  the  same  emission  in  a  form  of 
time  coincidence.  These  systems  provide  more  refined  measures  of  emitter 
location  baaed  on  a  Lor  an- like  time  of  arrival  method.  However,  the  sensors 


aa  y  be  of  the  etae  or  different  aodel  types,  so  the  variances  assoc  is  ted  vith 
each  sensing  of  the  eaitter  aay  veil  be  different.  For  anltiple  eaitters  the 
location  and  signal  paraaetrios  aeasures  aay  differ  and  so  vill  the  variances 
associated  vith  then. 

The  conclusions  to  be  dravn  froa  this  brief  discussion  of  FLINT  sen¬ 
sors  are  that: 

1)  The  differences  in  sensor-dependent  variances  are  included  in  the 
EEP  associated  vith  each  location  estiaate  sent  in  the  TACELINT 
aessages.  So,  location  data  is  suitable  for  subsequent  statisti¬ 
cal  manipulation. 

2)  The  differences  in  sensor-dependent  variances  for  the  signal 
paraaetrios,  and  indeed  the  variances,  are  not  conveyed  in  the 
TACELINT  aessages.  This  iaplies  that  the  signal  paraaetrios  are 
not  aaenable  to  subsequent  statistical  aanipulation* 

5*  OBSERVATIONS  AMD  CONCLUSIONS 

Throughout  this  discussion  ve  have  talked  interchangeably  and  indis- 
criainantly  about  tvo  kinds  of  statistics:  those  for  vhich  the  distribution  is 
knovn  and  those  for  vhich  it  is  not*  On  the  foraer  is  based  classical  statis¬ 
tical  inference  and  the  capability  to  test  hypotheses  and  aake  stateaents  vith 
soae  confidence.  These  are  the  basis  for  scientific  ezperiaental  evidence. 
The  latter  are  descriptive  or  "indicative”:  they  aay  indicate  soaething  about 
the  state  of  the  object  being  studied,  but  can  not  rigorously  support  any  such 
stateaent.  They  are  the  basis  of  auch  aanageaent  and  financial  decision  ask¬ 
ing.  Most  statistics  used  in  these  algorithas  are  indicative%  yet,  as  in  the 
oase  of  aobile  units,  fairly  sophisticated  inferences  are  being  dravn.  Just  as 
it  takes  a  skilled  and  experienced  Manager  to  aake  good  financial  decisions,  it 
takes  a  very  skilled  and  knovledgeable  intelligence  analyst  to  drav  the  aost 
froa  these  analysis  tools.  He  practices  not  science  but  an  art  fora  whose 
success  depends  on  his  individual  talents.  Autoaated  systeas  that  give  hia 
aore  and  aore  rav  data  to  handle,  and  only  ainiaal  help  in  handing  it,  are 
doing  hia  a  disservice.  For  these  systeas  to  carry  vith  thea  the  connotation 


of  scientific  statistics,  firing  an  unwarranted  confidence  in  their  results  and 
obscuring  the  crucial  role  played  by  the  analyst  does  hist  an  injustice. 

To  the  extent  tests  can  be  put  on  a  stronger  statistical  footing,  the 
quality  and  credibility  of  the  information  derived  from  these  systems,  and  thus 
the  real  support  they  provide  the  intelligence  anslyst.  will  increase.  Several 
initial  steps  are  possible  and  reconaended  to  aahe  statistically-based  informa¬ 
tion  aore  reliable. 

1)  Ensure  the  chi-squared  location  statistic  is  good  by  providing 
range-dependent  population  variances  (perhaps  obtained  initially 
froa  field  testing)  from  sensor  systems,  and  have  these  varian¬ 
ces  monitored  within  the  sensor  system  to  verify  their  continued 
validity. 

2)  Develop  distr ibut ions  for  some  signal  parametric  statistics, 
perhaps  using  a  time  history.  This  need  not  replace  the  linear 
discriminate,  but  could  provide  a  distribution  for  it. 

3)  Pay  closer  attention  to  distributions,  especially  when  refining 
(or  integrating)  signal  parametrics  information.  This  may 
involve  and  developing  speoific  prior/posterior  distributions. 

Implementing  some  of  these  suggestions  requires  additional  information  be 
carried  in  the  TACELINT  message. 

Finally,  it  is  not  yet  clear  which  or  what  combination  of  the  mathe¬ 
matical  approaches  -  classical  or  Bayesian  statistics  or  non-statistical  -  will 
best  serve  the  need  of  the  intelligence  analyst.  It  is  dear  that  all  three 
approaches  should  be  pursued,  in  parallel,  with  special  attention  to  the  mini¬ 
mum  information  which  will  be  required  by  oross-correlation  algorithms.  As 
stated  at  the  outset,  intelligence  correlation  is  an  hierarohioal  process*, 
each  part  must  not  only  be  as  sound  as  possible  within  itself,  but  also  proper¬ 
ly  fulfill  the  information  processing  requirements  of  its  role  in  the  overall 
prooess. 
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A.  AMNOXAXB)  BWBMC8  LIST 

lt(«teaofi  will  be  listed  ia  the  categories:  mathematics,  radars,  sad 
military  systems.  Bach  will  be  followed  by  a  brief  iadicatioa  of  its  formal 
lews!  (if  appropriate)  aad  applicable  areas  of  this  report.  Most  works  listed 
iaclade  good  refereace  lists. 

Mathematics 

Box,  G.  E.  P,  aad  Tiao,  G.  C.  Bayes isa  Iafereace  ia  Statistical 
Aaalysis.  Reading,  Massachusetts:  Addisoa-Vesley,  1973. 


A  mathematically  thorough  applioationw-motivated  senior/ gra¬ 
duate  text  on  Bayesian  inference,  accessible  to  those  in  other 
technical  fields. 


Chakravarti,  I.  M.,  Laha,  R.  G.,  Roy,  J.  Handbook  of  Methods  of 

Applied  Statistics.  Volume  I.  John  Wiley  and  Sons,  New  York: 
1967. 

A  well  organized  guide  to  descriptive  aad  inferential  statisti¬ 
cal  techniques,  with  clearly  stated  assumptions  and  examples*,  a 
good  section  on  multivariate  analysis. 

Deutsch,  R.  Estimations  Theory.  Englewood  Cliffs,  New  Jersey: 
Prentice-Hall,  1965. 

Standard  book  on  location  estimation,  confidence  ellipses,  and 
mathematical  estimation  arising  especially  In  radar  problems. 


Hoel,  P.  G.  IteSpflwUffP  iS.  Mathematical  Statistics.  New  York:  John 
Wiley  and  Sons,  1971. 


Provides  necessary  background  for  more  adv snood  books. 


to  Probability 


Hoel,  P 


Johnson 


Kendall 


Lass,  E. 


Scheffe' 


.  G.,  Port,  S •  snd  Stone,  C.  J.  Introdnotlon 

Theory.  Boston:  Houghton  Mifflin,  1971, 

_ •  Iatrodpotton  la  Statistical  Theory.  Boston. 

Houghton  Mifflin.  1971. 

These  two  volumes  provide  a  thorough  contemporary  mathemAticAl 
introduction  to  probAbility  and  statistics. 

H.  U,  and  Leone,  F.  C.  Statistical  and  Experimental  Design. 
New  York:  John  Tiley  and  Sons,  1977. 

Another  strong  section  on  multivariate  analysis,  with  examples* 
assumptions  not  as  explicitly  stated,  more  applications 
oriented. 

M.  G.  The  Advanced  Theory  of  Statistics.  Vols.  I  and  II. 
London:  Charles  Griffin  and  Company  Limited,  194$. 


Classical  treatment  of  statistics,  multivariate  analysis  pre¬ 
sented  in  seoond  volume,  more  analysis-oriented  development. 

,  and  Gottleib,  P.  Probability  and  Statistics.  Reading, 
Massachusetts:  Addison-Vesley,  1971. 


A  unified  introduction  to  probability  and  statistics  with  some 
foous  on  engineering  requirements. 

§  H.  2&R  Analysis  Variance.  New  York:  John  Viley  and 
Sons,  1959. 


Barton,  David  L  Radar  System  Ana  In  it.  Englewood  Cliffs,  New  Jersey: 
Prentice-Ball,  1965. 

This  graduate-level  text  treats  measurement  errors  especially 
well*  Mnltistatic  radar  systems  and  their  characteristics  are 
also  covered* 


Barton,  David  K«,  and  Ward,  Harold*  Handbook  of  E&4££  M0»««e*ent. 
Engle wood  Cliffs,  New  Jersey:  Prentice-Hell,  1969. 

This  handbook  emphasises  radar  measurement  errors  and  discusses 
those  due  to  digital  signal  processing  of  radar  signals. 

Cook,  Charles  E.,  and  Bernfeld,  Marvin.  Radar  Signals.  New  York: 
Academic  Press,  Inc.,  1967. 


The  most  comprehensive  source  for  radar  signal  design. 

Oppenheim,  Alan  V.,  ed.  Applications  of  Digital  Signal  Processing. 
Englewood  Cliffs,  New  Jersey:  Prentice-Hall,  1978. 

This  relatively  recent  compendium  contains  an  excellent  chapter 
on  applications  of  digital  signal  processing  to  radar  from  a 
non-hardware  viewpoint. 

Rabiner,  Lawrence  R.,  and  Gold,  Bernard.  Theory  and  Application  of 
Digital  Signal  Processing.  New  Tork:  McGraw-Hill,  1975* 


This  engineering  level  text  devotes  an  entire  chapter  to 
digital  signal  processing  applications  to  radar  with  a  strong 

hardware  emphasis. 


Skolnik,  Merrill  I.  Introduction  to  Radar  STStoas.  New  Tork:  MoGraw- 
■111,  1962. 


An  excellent  general  reference  oa  radar  systeas  although  dated. 


W 1  *£J_  Syil  ea» 

Beta  Correlation  Ceater  Application*  Coapater  Proxraa  Coaflaoratloa 
Itea  Dewelopaent  Specification  (No.  SS42-43B  Part  I).  Los 
Angeles:  TRW.  1980. 


Beta  Correlation  Ceater  Applications  Coapater  Proaraa  Configuration 
Itea  Dewelopaent  Specification.  Volaae  I  and  Appendix  II 
[Correlation  Processing  CPC]  (No.  SS22-43  Part  II).  Los 
Angeles:  TRW,  1981. 

Eastace,  Lake,  and  Hartaaa,  eds.  The  International  Coanteraeasares 

Handbook.  7th  Edition  [1981-1982].  Palo  Alto,  California:  EW 
Coaaaaieatlons,  1982. 

laky,  0.  C.  Weapons  and  Tactics  of  ths  Sowlet  Arar.  London:  Jane's, 
1981. 

Technical  Directive  005  for  TCAC(D)ASAS-SEWS  ADM  Message  Processing 
Tactical  ELINT  Message  (No.  CAC-TDM-00S).  Burlington. 
Maaaachaaetta:  RCA  Corporation,  1981. 

Intelligence  Message  Foraatting  and  Procedures,  User  Handbook,  Amy 
Test  Unit,  DRSEL-SBI-ATU,  Fort  Monaouth,  New  Jersey, 

21  Septaaber  1981. 
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package  EDR_Package  is 

type  Taceilntnsq  is  private; 

type  EDR  is  private; 

type  Radar^T  is  C type_a/ type_b> type— c ) ; 

type  REAL  is  digits  7; 

function  GET_EDR  ( Msg  :  Tace  1  In  tflsg  )  return  EDR; 
function  RADAR_TYPE  < Rec : EDR )  return  Radar_T; 
function  FETCH  (First : Boo  lean;  Radar:  Radar_T) 

return  EDR; 

function  BOX  (  0 1  dRec  >  NemRec  :  EDR;  Rad  ar  :  Radar_T  ) 

return  Boolean; 
procedure  CORRELATE  ( 0  1  dRec  .  NeuiRec  :  in  EDR; 

Radar  :  in  Raaar_T; 

tioc  :  out  Real  ) ; 

procedure  STORE  (NeuiRec:  m  EDR;  Radar  :  m  Raclar_T; 

OK  :  out  Boo  le  an  )  ; 

procedure  REPLACE  (  0  l  dRec  >  NeuiRec  :  in  EDR;  ok:  out  Boolean  ' 

function  INTEGRATE  (  0  1  dree  >  Nemr  ec  :  in  EDR  ;  return  lvr; 

pr i vate 

— full  type  declarations  for  TacellntMsg  and  EDR 
end  £DR_Package; 

--Inside  the  package  oody  EDR_P  ac  k  age  tne 
— procedures  STORE  *  REPLACE .  and  FETCH  mil*  &e 
--implemented  Oy  calling  appreciate  entries  of  a 

—  "monitor"  task  (njhicn  serves  as  a  synchr  on  i  rat  i  on 

—  agent  for  accesssmg  the  common  data  dase). 

ujith  EDR_Package  ;  use  EDR_Package; 
procedure  MAIN  is 

Some_Cond it  ion  : Boolean; 

0 1 dRec  > NemRec  : EDR; 

Procedure  Keep_Best_EDR  is  separate; 

deg  i  n 

NemRec  :=  GET_EDR  (Msg); 

Radar  iz  RADAR_TYPE  (NemRec-: 
mhiie  Some_Cond i t i on  loop 

0 1 dRec  :=  FETCH  (First.  Radar); 
if  BOX  (OidRec*  N emRec<  Radar)  then 
Keep_Best_EDR; 
end  if; 
end  loop; 
if  noc  >  Max  then 

STORE  (NemRec.  Radar.  OK); 
elsif  Mac  <  Mm  then 

REPLACE  (OidRec.  INTEGRATE ( 0 l dPec . NernPec > . OK  ; 

e  i  se 

go_to_operator; 

end  if; 


end  mam; 
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MODULE  aCSPET  <  INPUT.  OUTPUT  >  ; 

PROCEDURE  ACSRET; 

CNAME:  AC3PET-C COMPARISON  PROCESS  CONTROL)}* 

CPUPP05E :  COMPARISON  DETERMINES  THE  SIMILARITY  BETWEEN  SUBJECT  AMI) 

..'Candidate  edrs.  subject  and  candidate  edps  arpiuk  at 

<  CCMPAP I SON  FOLLOWING  DATA  BASE  RETRIEVAL  UIA  PROCESS  CONTROL. 

<A  MEASURE  OF  CORRELATION  (MOO  DEFINING  THE  LIKELIHOOD  THAT  THE 
{SUBJECT  EDR  AND  THE  CANDIDATE  EDR  REFER  TO  THE  SAME  ENTITY  is 
COMPUTED.  THIS  MEASURE  IS  COMPUTED  AS  A  LINEAR  COMBINATION  OF  THE 
C RESULTS  OF  FIELD  COMPARISON  ^ESTS  FOP  CERTAIN  CORRESPOND  I NO  FIELDS 
OOF  THE 


JUC.  C.'_  i 


and  Candidate  edps 


C INCLUDE 


:  :e :<p  .  con1* ; 

: : -  dr.  :  /m  l 
:mppr  .  com'*: 
I  I N  7  E  j  '£  P  J :  :  /  »  P  C  F  L  r«G  *  L 
CD  I MENS  I  . n  ^S~JAM  i" 


.INCLUDE 


pp  ; 


:  i  cr  c  •*»'>•**-'* 


a  £  !.i0 

33Q0  ipepfopm  f-:m  'rE2r  for  location; 

3  -i  0  3 

3500  hCSPL*: 

3600  FOfwrj  :  z  ITEPPC  J  1  ; 

S-00  IF  :  IUCkC  I  ]  =  3  »  r'tND  •  PCFLAG  =  0 

3  =  00  *riEN  3E3IN 

3  300 

3  0  0  v3  >*  p  r  ©  f  o  R  m  r  0  N  TEST  F  0  R  T  'f  :i£  j 


3300 

AC3RFT ; 

=  330 
5400 

o  r  z  f  o  ^ 

F0?1  TE3t  FOR  RADaR  FREQUENCY} 

3f  ._»0 

9603 

ACSPFF ; 

3"  30 

?  r  0  0 

.*  p  r  3  c*  j.  •=  f  i 

F0!1  TEST  FOR  PM  l  32/ 

a?  00 

1  0830 

hCSRF I ; 

;  o :  oo 

10133 

\PEPFOPM 

F0I1  TEST  FOR  PULSE  DURATION} 

13303 

13400 

ACSRFD ; 

10500 

13600 

calculate  moo 

1  0  T  *3  0 
10303 

FOP  r  : s  1  TO  5  DO 

13900 

3E3IN 

11000 

IF  CIUCKlI]  r  0)  AND  CITEFFLII  ■:  [L 

11103 

THEN  BEGIN 

:  1133 

<RWT  :=  floatc iweighc ixsltj, : 

1  1  303 

HOC  :=  NOC  *  RUT  *  F  OMCMF  ’  ’ 

:  1430 

SUN  : z  SUN  4  RWT 

11530 

end; 

1  1600 

end; 

1  1~00 

11330 

IF  (SUN  <>  0)  THEN  NOC  ir  NCC^SUN 

1  1900 
12003 

END 

12100 

5  NO  ELSE 

>  IYCk  »  RCFLAGO 

12200 

END 

12200 

ELSE  I 

vrsp  ; =  * 

12  430 

END:  l  OF 

PROCEDURE  ACSRi;  T  } 

12530 

12603 

3‘C  .  :of 

M,.‘  D’JLE  A C S P E T > 

inft,  VJLLZ 
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2^00 
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4130 
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.1 0  D  U  L  £  AC5RL' 


PROCEDURE  ACSRLT; 

{ NAME  '•  ACSRLT  -  FOM  TEST  FOR  LOCATION} 

{PURPOSE :  THIS  FIGURE  OF  MERIT  TEST  MEASURES  THE  LIKELIHOOD  THAT  THE> 

{subject  s  candidate  edrs  refer  to  the  same  entity  on  the  battlefield. > 

{THIS  FIGURE  OF  MERIT  IS  COMPUTED  AS  PROBABILITY  GIUEN  THEIR  LOCATION} 
{S  LOCATION  ERRORS  THAT  THE  SUBJECT  3.  CANDIDATE  EDRS  ARE  COLOCATED} 

{{PARAMETER  EDFLT  =  20/000,000,  NDFLT  z  10,000,  000  ?■ 

{PARAMETER  MAXDIS  =  18.-12  !  EXP  v  -  .  5*D  1ST  )  <  .  0001} 

{INCLUDE  ” ACCEDR . COM" } 

{COMMON  'CMPLOC.'FOMCMP(S)  /  IUCk'5)  -  ITERR(5)> 


CONST 

EDFLT  =  20000000: 
NDFLT  =  1 0030000; 

MAXDIS  =  13.42: 

-JAR 

{LOCAL  VARIABLES} 


DIFFE 
D  I  rrN 
riDD  1  1 
^  L  D  1 2 
4DD22 


real; 

®eal; 

REAL : 

seal; 
-Eal  ; 
seal; 

peal; 


{GLOBAL  VARIABLES} 
3L0C  :  ARRAY  Ci. 
CL OC  :  ARRAY  Z 1 . 
CSIGMA  :  ARRAY  Cl. 
SSI GMA  t  hRPAY  C 1 . 
IUCK  :  APR AY  C 
ITERR  :  APR AY  C 
FOMCMP  :  ARRAY  Cl 

BEGIN 


ARRAY  C  1 .  . 51  OF  REAL  J 
ARRAY  C 1 . .53  OF  REAL; 

ARRAY  Cl. .S3  OF  REAL ; 
hRPAY  Cl.  .S3  OF  REAL  : 

:  array  ci.. 51  of  integer; 
:  APRAY  Cl.. 51  OF  integer; 
:  ARRAY  C  1  .  . 51  OF  real; 


{CHECK  FIELDS  FOR  UAL ID  DATA} 


4300 

4300 

5000 

5100 

5200 

IF  f'ABS  'SLGCCll)  >  EDFLT) 

< 486' CLDC: 1 2 1  >  EDFLT) 

(  ABS  '  SL0CC  3  3  )  >  NDFl.  X  ) 

( ABS C CLOCC 23 )  >  NDFLT)) 

THEN  I'JCK  Cl]  :  =  1 

OR 

OP 

OR 

5303 

ELSE  IF  SSI GMA C 1]  <  0  ) 

OP 

'  SSI 

3MAC  SI 

5400 

5500 

5600 

'  CSIGMAC 1 J  <  0) 

’HEN  IUCKC  13  :  :  1  ; 

OP  - 

CS I G 

MftCSl 

5700 

5900 

5300 

6000 

{COMPUTE  DETERM INN AN ^ 

if  • :uck:  ::  =  a 

T^E N  BE  J I N 

DET 

IF 

J  A  r  A 

6130 

:  =  fljatj-c loc 

\  * 

—  I  _ 

..... 

6200 

{ d i rrN  : z  f_:a~;  gl:c 

.  ^ 

—  I L 

:  c  {  l  : 

6  300 

HDDll  -  -  SSI  3M4C  l .!  * 

-  3  I  J- 

6400 

65?G 

-z'.’.z.  -■  z  ss ig.'-’pc r •  : 

A  ^  * 

— 

6730 

6300 

6900 

7000 

7100 

7200 

7300 

7400 

7500 

7600 

7700 

7900 

7900 

3000 

9100 

9290 

33*0 

3409 

3500 

3600 


CCnt-CK  FU*  ‘-HLlO  CONUHR ImMOE  nHrRIC23} 

IF  URDDll  >z  0)  AND  tDET  >  0O 
THEN  BEGIN 

■C  COMPUTE  F OM  FOR  LOCATIONO 

DIST  (ADD22  *  <DIFFE**2)  -  2.0  *  DIPTN  *  DIFFN  *  A  1)1)  12 
+  ADD  1 1  *  CDIFFN  **2 ) )  ✓  D£T; 

IF  C  DIST  < =  MAXDXS  > 

THEN  F  OfiCMP  C  i  3  :  =  EXPC-O.S  *  DIST) 

END 

ELSE  ITERRC13  :=  i; 

END 

•C  NO  ELSE  AS  FOMCMP  PRESET  TO  ZERO 
END:  OOF  PROCEDURE  AC3RL 
END.  OOF  MODULE  HC3Rl:> 


•  INPUT ,  on rrut . ; 
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’-'yijL£  «  C  3  R  F  T 
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The  following  three  pages  on  the  BETA  system  were  provided  by  Veotor 
Research  Inc.  Section  D-l,»  MA  Location  Estimating  Algorithm/*  presents 
maximum  likelihood  estimating  methods  for  radar  location  supported  by  more  than 
one  sensor  system.  Section  D-2,  "Algorithm  Attributes/*  considers  the  BETA 
system  from  a  Bayesian  perspective.  Section  D-3»  "BETA  Self-Correlationnn: 
Evaluation  Methodology/*  discusses  some  of  the  more  philosophical  aspects  of 
algorithm  anaysis. 

These  analyses  are  referred  to  within  the  related  sections  of  the  main 
report,  and  some  of  the  more  significant  conclusions  incorporated  in  the  final 
observations  and  conclusions  (Section  5). 
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1.0  INTRODUCTION 

One  of  the  principal  functions  performed  by  BETA  is  the  development 
of  estimates  of  target  locations.  This  chapter  examines  one  algorithm 
that  updates  estimates  of  target  locations  as  successive  reports  are 
received  and  processed,  the  discussion  that  follows  assunes  that  the 
reports  are  correctly  associated,  that  Is,  that  each  provides  an  estimate 
of  the  same  location. 
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2.0  ASSUMPTIONS 


Errors  In  estimating  location  arise  primarily  from  sensor  perform¬ 
ance.  As  such,  sources  are  both  random  and  systematic,  examples  of  the 
latter  being  Improper  calibration  or  Incorrect  locations  of  sensor 
components.  For  the  discussion  which  follows  we  assume  that  systematic 
errors  are  zero  or  equivalently  that  calibration  has  removed  bias.  We 
further  assume  that  target  location  estimates  are  distributed  according 

to  a  bivariate  normal  distribution  with  the  true  location  given  by  the 

(ui\ 

mean  *  l  u  2  / 

Reported  locations  are  denoted  by  X ^  */XjA 

\  *21/ 


with  covariance  matrix  * 


oil' 


P  1alia21 


2 

P 1°11a21  a2i 

whereof  and  are  the  variances  of  the  random  variables 
X^-j  and  X2i ,  respectively  and  p ^  is  the  correlation  coefficient 
associated  with  their  covariance.  Thus,  the  density  function  of  a 
reported  location  x  Is: 

1 

exp  (  -  - 

2 


exp  (-  _i_(x-!i)TA-1(x-u)J 


2  7T  JA(  1/2 

Reported  locations  are  assuned  to  be  Independent,  with  different 
covariance  matrixes  corresponding  to  different  sensors  and/or  single 
sensors  In  different  locations  relative  to  the  target.  We  assune  that  If 
a  report  Is  based  on  several  lines  of  bearing,  or  a  series  of  fixes,  then 
the  appropriate  reduction  In  variance  Is  reflected  In  the  covariance 
matrix  used  In  updating  the  location  estimate,  l.e.,  the  covariance 
matrix  Is  always  known. 
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3.0  MAXIMUM  LIKttlHOOO  ESTIMATE 


As  an  estimate  of  target  location,  the  maximum  likelihood  estimate 
Is  derived.!  Consider  a  collection  of  reported  target  locations  xi  ; 
1>1»1,2,...,N  with  covariance  matrixes  ;  1»1,2,...,N.  Then  the 
likelihood  the  likelihood  function  Is  proportional  to: 

exp  ^ -1/2  £ 

Taking  partial  derivatives  with  respect  to  j  and  2  and  equating  the 
results  to  zero  one  obtains  for  the  estimates  uj  and  U2’ 

(s*'j*  ■  (,f,  ’■•'») 

or  *  ■  1  r  1  •  r  1  i  • 

Vw  /  1,1  V  / 

The  expected  value  of  u  Is  the  true  location  p  and  the  covariance 


matrix 


Is  /  N  V1 

(5  n 


Mote  that  In  terms  of  Implementing  this  algorithm  for  estimating  jj  one 
must  store  two  quantities  namely: 


Ck 

1*1 


iFor  a  more  general  discussion  see:  Anderson,  T.W. ,  An  Introduction  to 
Multivariate  Statistics,  John  Wiley  and  Sons,  Inc.,  Mew  York,  1958. 
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4.0  ORIENTATION 


One  of  the  features  of  current  ELINT  sensor  systems  Is  that  errors 
In  range  generally  are  larger  than  those  In  azimuth.  Further,  because  of 
different  sensor  system  locations  error  ellipses  will  have  different 
orientations.  Consider  a  standard  Cartesian  coordinate  system  with  u  * 
o.  The  the  error  ellipse  Is  described  by  jJa"*x_,  where 

1  -p 

(l-p2)ai2  (l-p^)aia2 

-P  I 

(I_p2)aia2  (1-P2)°22 

-T.-lw  Xl2  _  2X1X2_  *  X22 

iA  -  (l-pZjo^  (1-02)0^2  (1_p2)a22> 


Consider  the  rotation 


where 


yi  »  Xlcos e -  x2s1ne 
y 2  ■  x^slne*  x2cose 


Applying 


this  rotation  we  obtain  a  new  random  vector 


with  covariance  matrix: 


cose  s1ne\/  a\  po  \  /  cose  -  sine 


y$1ne  cos  y\pa  102  °2  J  y  *1n®  cos ey 

which  has  zero  correlation,  or  equivalently  9  Is  the  orientation  of  the 


error  ellipse  with  the  baseline  of  the  coordinate  system. 


5.0  OBSERVATIONS 


To  Illustrate  the  Implications  of  the  target  location  algorithm 
consider  a  simplified  example  In  which  all  reported  locations  have 
covariance  matrix 

/  2 

>12  0 

)  c>i‘ 

In  this  case  after  N  reports,  the  covariance  matrix  of  the  estimate 
Is: 

Jl2  0 

7" 

o 

N 


‘l2 


If  artillery  accuracy  requirements  are  expressed  In  terms  of  a  100  meter 
CEP  we  require  an  equivalent  variance  of: 


o 


R 


100 

(21J12)1/2 


*  84.93 


Since  ai  Is  constant  across  all  reports,  the  nunber  of  reports 
required  to  achieve  targeting  accuracy  is  given  by: 

This  function  Is  presented  In  exhibit  5-1. 


Number 

Reports 
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The  Implications  are  that  achieving  adequate  accuracy  will  require  a 
significant  number  of  reports  unless  sensor  performance  Improves 
substantially.  For  example,  consider  a  sensor  which  has  a  covariance 
matrix:  /  \ 


(200)2 
0 


0 

(500)2 


representing  a  system  with  range  errors  greater  than  azimuth  errors. 

Using  an  approximation  to  obtain  an  equivalent  CEP  it  can  be  shown  that 
16  reports  would  be  required  to  achieve  an  equivalent  CEP  of  100  meters. 
In  terms  of  current  ELINT  systems,  and  artillery  target  location  accuracy 
requirements,  the  implication  is  that  until  precision  location  systems 
are  fielded  the  number  of  targets  developed  and  nominated  for  artillery 
missions  will  be  limited.  This  may  not  preclude  use  of  air  assets;  both 
fixed  wing  and  helicopters.  Note,  moreover,  that  the  Introduction  of 
precision  location  systems  contributes  to  the  efficiency  of  self- 
correlation,  but  If  errors  are  sufficiently  small  ,  self  correlation  is 
not  as  necessary  for  target  nomination.  This  raises  the  Issue  of 
emphasis  In  the  system:  is  It  target  development,  situation  development, 
or  both?  If  target  development  Is  emphasized.  It  Is  difficult  with  the 
current  sensor  suite.  On  the  other  hand,  if  precision  location  systems 
are  Introduced,  location  estimation  Is  unnecessary  and  context  may  be 
more  Important. 
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1.0  INTRODUCTION 

In  order  to  provide  an  approach  to  evaluating  and  classifying 
self- correlation  algorithms,  this  paper  examines  such  algorithms  in  the 
context  of  Bayesian  decision  theory. *  In  particular,  comparisons  are 
made  of  example  Bayesian  decision  criteria  with  one  approach  used  In 
BETA,  namely: 

"correlate  1f£  a^>h.“ 

In  certain  Instances  reports  received  by  BETA  do  not  contain  values  for 
one  or  more  of  the  data  elements.  In  such  cases  it  is  our  understanding 
that  the  procedure  Implemented  Involves  renormalizing  the  weights  used  In 
the  statistic,  l.e..  If  figure  of  merit  xj  cannot  be  calculated,  the 
weights  used  are: 

a1 

The  Implications  of  this  procedure  are  also  examined. 

The  purpose  of  algorithms  such  as  those  employed  In  BETA  Is  to  make 
decisions  based  on  some  pre-established  criteria.  Any  particular 
decision  Is  made  by  considering  data  reported  by  collection  systems.  As 
such,  the  data  Is  subject  to  a  series  of  errors  which  cannot  be  predicted 
with  certainty  In  advance.  For  example,  suppose  an  emitter  Is  operating 
at  a  frequency  f.  The  frequency  fr  reported  by  a  collection  system 
will  not  In  general  equal  f,  but 

f  r  »  f  +  A  f$ 

Ipor  a  more  general  discussion  see:  DeGroot,  Morris  H.,  Optimal 
Statistical  Decisions,  McGraw-Hill,  Inc.,  Mew  York,  1970. 


i 


V 
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where  the  difference  Af$  Is  a  function  of  collection  system 
performance,  environment,  battlefield  geometry,  etc.  The  difference 
cannot  be  predicted  In  advance,  but  can  be  described  probabilistically, 
l.e.,  the  reported  frequency  fr  Is  a  random  variable  with  an 
appropriate  distribution  function.  Similarly,  the  absolute  value  fo  the 
difference  between  two  reported  frequencies  Is  a  random  variable, 
although  It  need  not  be  a  figure  of  merit  In  the  BETA  sense,  l.e.,  take 
values  only  between  zero  and  one.  However,  It  Is  worth  noting  that  If 
Is  a  random  variable  with  distribution  function  F(x): 

Pr[X  <  x]  »  F(x), 

then  the  random  variable  Y  defined  by 

Y  -  F(X) 

takes  values  only  between  zero  and  one  and  moreover  has  a  uniform 
distribution,  l.e., 

*  y  0  <y  <  1 

Pr[Y  <  y]  «  0  y  <  0 
-  1  y  >  1. 

The  location  difference  figure  of  merit  used  In  BETA,  exp(-d/2),  has  this 
distribution  If  the  two  reported  locations  are  In  fact  the  same. 

In  general,  algorithms  such  as  those  In  BETA  asune  that  ground  truth 
falls  Into  a  set  of  mutually  exclusive  categories  or  states  of  nature 
(denoted  In  this  paper  by  ,  only  one  of  which  Is  true).  The 
algorithms  provide  a  means  of  deciding  which  particular  state  of  nature 
Is  true  using  observed  or  collected  data.  The  assumption  underlying  most 
such  algorithms  Is  that  the  distribution  functions  describing  the  random 
nature  of  the  data  depend  upon  the  states  of  nature.  Differences  among 
the  distributions  are  then  used  to  construct  appropriate  decision 


criteria  and/or  analyze  algorithm  performance.  In  the  remainder  of  this 
paper  a  Bayesian  structure  Is  used  as  a  framework  to  analyze  the  linear 
combination  of  figures  of  merit  used  In  BETA  sel f-correlatlon  algorithms. 
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2.0  M00EL 


Consider  a  decision  problem  Involving  only  two  states  of  nature  a  * 
{  *1»*2  }  and  &1®  decisions  D  *  [  »dg }  •  Following  the  observation 
of  a  randan  vector  x,  a  decision  Is  made  with  loss  matrix: 


i.e.,  If  the  decision  dj  is  taken  when  wg  is  true,  a  loss  of  £21 
Is  incurred.  Prior  to  the  observation  of  x,  the  decision  maker  believes: 

Pr[W  *  wj]  *  p,  and 
Pr[W  *  w23  *  1-p, 

with  the  conditional  and  unconditional  densities  of  x  given  by  f(_x|w^) 
and  fU),  respectively.  In  this  situation  It  can  be  shown  that  the  Bayes 
decision,  l.e.,  that  which  minimizes  the  expected  loss.  Is  described  by: 


Decide  d^(w  *  wj)  If 


Pr[W 


and  d2  (W  *  w2)  If 


wi|x] 


*21 

*21  *  t\2 


*0  say 


Pr[W 


w2|x] 


•£•21 

*21  +  *12 
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The  posterior  probability  PrCW  *  wjJ  x]  Is  given  by: 

pf(x|*i) 

pf  ( x  |wj)  +  (1  -p)  f ( x  j*2 ) 

Substituting  one  obtains  the  rule:  decide  dj  If: 

f(x|*n)  £0(1_P) 

-  >  -  a  y  say. 

f(x|v<2)  p(l-£0) 


3.0  EXAMPLES 


Consider  the  case  when  the  components  of  J  are  Independent  and 
normally  distributed.  Suppose: 

1 


fl(Xi|Wi) 


VTrroj 


exp  | 


V 


2y 


and 


f.ix.ln)  •  -L 

>/2vo.\ 


2  of 1 


Then  from  the  previous  examples  we  have: 


f  (_x  lw^ ) 


f(_x  lv*2 ) 


exp  {  - 


xj*-  (x^-u^)^ 


2o4  2 


or  select  dj  If 


exp 


Talcing  logarithms  and  rearranging  terms: 

,2 


^  U1X1 

£  9 


r1 


»»r 

£  — -2 
2^2 


In  v 


This  expression  bears  some  resemblance  to  the  BETA  algorithm  and  It  Is 
worth  considering  the  Impact  of  the  BETA  procedure  for  missing  data. 
First,  note  that  If  some  value,  say  xj.  Is  missing,  the  new  decision 


ui< 


rule  Is  select  dj  If  u^x^ 

1^3  ®|^  1Aj  2o]  2 


In 


Now  consider  a  normalization  of  the  coefficients.  In  the  first  case  by 
normalizing  we  obtain: 
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If  element  xj  Is  missing,  renormalization  of  the  coefficients  yields 
for  the  left  hand  side 


Note,  however,  that  renormalization  Is  only  consistent  If  an  appropriate 
change  is  made  to  the  right  hand  side  of  the  Inequality  both  In  terms  of 

PI* 

the  sun  £ - -  and  the  normalizing  factor. 

2oj 2 


Now  suppose  that 

1 

fl  (  X1 J  Wj  )  *  -  l  1  <  X-f  <  U{ 

“1-*  1 

*  0  el  sewhere. 

In  this  case  the  x-j  are  uniformly  distributed  on  the  Interval  [£^,u-j] 
bearing  some  similarity  to  the  BETA  approach.  Then  the  decision  criterion 
based  on  x  Is: 
select  dj  If 


f(_x|w2)  (1—^o^P 


Note  that  If  for  any  1 

X1  <*1 


*  Ysay 


or  Xi  >.  u1 

then  d}  Is  rejected.  If  this  Is  not  the  case  an  appropriate  threshold 
can  be  set. 
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Consider  the  expression: 

.  p(l-V 


-1 


« y  jsay 


Again  assune  that  ff(xj|w2)  Is  a  normal  density  with  mean  and 
variance  aj2.  Substituting: 


n  ( 2  TTO|  2)1/2 

(Xi-Uf)2 


exP  -E 


(x^-u^)2 
2p*  2 


<  Yi 


or 


2a1 ' 


>  -  InY  j  -  ln(  n  (z™^2)!#), 


(X^-Uf)2 

Provided  the  figures  of  merit  were  In  fact  the  -  ,  then  the 


coefficients  could  be  considered  to  be  The  same 

normalizing  problem  as  previously  discussed  exists,  namely  one  must  not 
only  normalize  but  also  adjust  the  decision  threshold. 

We  turn  now  to  the  case  of  dependence  between  the  components  of  the 
decision  vector  x.  For  a  variety  of  reasons  this  is  likely  to  be  the 
case,  particularly  for  the  signal  parameters.  We  use  the  same  example, 
namely  decide  d^  if: 

f  Ot  Iwj. ) 

- —  >  if  0 

f(x  |W2) 

As  before  assune: 

f(xlwi)  Is  N(0,£) 

and  f(j(|w2)  Is  N(u_,£) , 

with  non-zero  correlations,  l.e., 

£.  (*  PVA 

\  p0l02  °22  J 

where  °i2  and  o22  *re  the  variances  of  x^  and  x2  and  p  ,  the 


correlation  coefficient.  Is  non-zero. 


Substituting,  the  criteria  becomes: 


exp  ( -  -  xt£“1x 


exp  1  (x-  u_)T  £ "l (x-u)^ 


>  VO 


1  T  i  1  r  , 

or  —  xt£-*x  +  — ( x- u^)  (  X".r)  >  ^  yq 

or  ( x-j)T2_1  U-ji)  -  xT£"l£  >  21n  Yo- 
This  Is  equivalent  to: 

xT£‘1j±  <  -a  ‘  lnY0* 

As  In  the  case  of  Independence,  this  criteria  can  be  expressed  as  a 
linear  discriminant.  In  particular,  for  m2,  the  coefficients  are: 

02^1  *  P12al°2'12 


xi 


x2 


°12o22(1-p122) 
“p12°la2lll  +  °12°2  ) 


j  ffl2022d-p122) 

where  p^j  are  correlation  coefficients, 
and 

*2  2pi2uiP2 


tix-h  * 


v  V 


u2 


(l-P122)oi2  aio2^“P122)  o22(1*p122) 

For  m3  the  coefficient  of  x^  becomes: 

(1t3  232)o22°32'11  +  <Y102°32(P13P23“P12^2  +  oio22o3^Pl2P23"P13^3  ^ 
ffl2<J22<^2^“P122*PI32"P232  +  2P12PI3  +  2p12p23  +  2p13p23^ 
and  for  X2 

°10  2°32(p13P23"P12^1  +  {1“P132)pI2o32v2  +  P12P2®3^Pi2P13“P23V3  ^ 
oi2022°32(l-Pi22  “P132"P232  +  2P12P13+2P12P23  +  2P13P23^ 
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While  In  the  case  of  Independence,  provided  the  threshold  mss  adjusted, 
normalization  made  sense,  a  similar  conclusion  cannot  be  made  for 
dependence  among  the  random  variables,  as  can  be  easily  seen  by  adding 
the  coefficients  for  xj  and  X2  In  both  cases  and  then  dividing  the 
coefficients  by  the  resulting  sun  and  comparing  the  results  to  a  similar 
operation  when  n*2.  Thus,  the  following  observation:  decision  rules 
exist  such  that  subject  to  alternation  of  the  decision  threshold, 
normalization  to  account  for  missing  data  can  be  Justified  If  the  various 
components  of  the  decision  variable  x  are  Independent.  If,  on  the  other 
hand,  the  components  are  dependent  or  correlated,  normal  Izatlon  cannot  be 
J  usti  f  1  ed . 
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4.0  TINE 

In  the  discussion  up  to  this  point,  no  mention  has  been  made  of 
time.  Given  any  two  candidates  for  self-correlation,  the  time  first 
observed  and  time  last  observed  are  available.  Suppose  for  the  two 
candidates  these  times  are: 

Candidate  1:  tjo  t20 

Candidate  2:  t^  t2i  • 

The  values  of  these  four  variables  give  rise  to  three  cases: 

overlap,  1st  candidate  first; 

overlap,  2nd  candidate  first; 

overlap,  1st  candidate  contains  2nd; 

(2)  +  \  and 

•overlap,  1st  candidate  contained  in  2nd; 

no  overlap,  1st  candidate  first 

no  overlap,  2nd  candidate  first 

The  analyst,  decision  maker,  or  in  this  case,  algorithm,  can  and  should 

distinguish  among  these  three  general  cases.  First,  all  other  things 

considered,  the  prior  probabilities  for  the  first  two  probably  remain  the 

same,  as  do  the  conditional  distributions  or  likelihood  functions.  In 

fact.  In  the  extreme,  e.g.,  simultaneous  observation  of  an  emission  by 

different  sensors,  differences  In  reported  data  should  only  be  due  to 

variations  In  sensor  performance  and  not  to  such  controllable  differences 

as  selection  of  frequency,  etc.,  whereas.  If  reports  are  separated  in 

time,  the  conditional  probabilities  f(x|w^)  may  be  more  diffuse.  Note 

that  the  location  estimates  are  significant  In  this  case:  e.g..  If  the 

reports  should  be  correlated,  the  location  should  be  the  same.  This  Is 


not  the  case  If  the  reported  observation  Intervals  do  not  overlap.  For 
simplicity,  consider  four  cases,  distinguished  by  whether  or  not  location 
estimates  are  "close"  and  whether  or  not  signal  parameters  are  "close." 

If  location  estimates  are  not  close  and  signal  parameters  are  not  close, 
self-correlation  Is  probably  Inappropriate.  If  location  estimates  are 
close  and  signal  parameters  are  not  close,  the  possibility  of 
"co-located“  emitters  must  be  considered*  or  the  possibility  that  a  new 
emitter  has  replaced  the  original.  Finally,  If  location  estimates  are 
not  close  but  signal  parametrlcs  are  consistent,  the  time  and  distance 
relationships  must  be  considered,  l.e.,  the  potential  displacement  must 
be  consistent  with  teardown,  set-up,  and  travel  times.  The  implications 
in  terms  of  the  simple  Bayesian  example  are  described  as  follows.  For 
cases  1  and  2  above  the  analyst  Is  likely  to  retain  the  same  prior  and 
likelihood  functions.  If  locations  are  close,  the  signal  parametrlcs 
will  decide;  If  signal  parametrlcs  are  close,  locations  will  decide.  As 
time  elapses  without  a  report,  the  probability  that  the  entity  has 
remained  In  the  same  location  will  decrease,  as  will  the  probability  that 
Its  signal  parametrlcs  remain  constant.  For  the  sake  of  Illustrate  n  let 
the  time  required  for  a  typical  displacement  be  At  and  the  distance  Ad. 
Further,  suppose  that  the  elapsed  time  Is  t,  and  that  x  measures  location 
difference  (xj)  and  signal  parametric  difference  (X2).  Then: 
f  (xi,X2|W})  may  be  Independent  of  t 
f  (xi,X2|*2)  “  0  If  At>t 
f  (xi,X2|*3)  may  be  Independent  of  t 

*For  example,  the  LONGTRACK  radars  at  an  SA-6  regimental  headquarters. 
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where  w^  *  same  entity,  same  location,  same  parametric s 
*2  *  same  entity,  new  location,  same  parametrlcs 
W3  *  different  entity,  different  location,  same  or  different 
parametrlcs. 

As  an  Illustration  of  the  characteristics  of  the  problem  of  time, 
consider  a  simplified  situation  with  three  decisions: 
d}'.  correlate:  same  entity,  same  location; 
d2^  correlate:  same  entity,  new  location;  and 
d3:  do  not  correlate:  different  entitles,  different  locations. 
Then  It  can  be  shown  that  the  same  form  of  decision  criteria  can  be 


derived,  namely  select  decision  dj  If  the  posterior  probability  of  w^ 

Is  greater  than  a  threshold  Yj,  or 

f(xiwi)pi 

_ ! _  >  Y  1 

f(x|wi)pi  +  f(x|W2)p2  +  flx|w3)p3 

Two  elements  are  significant  here.  First,  In  the  absence  of  reports 
(and  perhaps  given  the  knowledge  of  dwell  time  and  likelihood  of  observa¬ 
tions)  the  prior  probability  p^  will  decrease  and  the  prior  probability 
P2  will  increase.  Second,  the  likelihood  f(jt | w3 )  depends  upon  the 
time  between  observations.  In  particular,  let  X}  be  a  measure  of  the 
distance  between  reported  locations  at  times  t3o  and  t^,  with 
t  ■  tn*t20*  Then 


f(j(|W2)  *  f(w2jjt)  .  -■  - 

If  t  <  as  +  a £,  (tear  down  plus  set  up  time) ,  then 

f(w2|£)  ■  0 

and  rtx.|*2^  *  °* 
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Similarly,  If  As  ♦  At  <  t  <  xj  ♦  A$  *  At 

v 

where  v  Is  a  typical  travel  speed,  then 

ffwelx)  «  0 

and  fOilwg)  *  0. 

Finally,  If  t  >  V’At+ii  *^er«  d  Is  a  typical  dlsplacenent 

v 

distance,  suppose  that  f(xlwg)  Is  non-zero.  Now  consider  the  decision 
dj.  For  dj  we  have: 

Plf(xlwi) 

pjf  ( x  Iw^ )  +  P2f(xlw2)  +  P3f(xl*3) 

For  t  <  s+  t+d_,  this  expression  Is 
v 

pjf(x  Iwj) 

■—  ■  ■  -  ,  - — -  >  Y  i 

p  jf  ( x  Iwj  >  +  P3f(xlw3) 

For  t  >  V'V'il*  the  expression  becomes 
v 

pjf  ( xl  wj ) 

- j - 1 - 1 -  >>  Y  i 

Plf(xlwi)  +  P2f(xlw2)  +  P3f(xlw3) 

Note  that  In  the  BETA  algorlthn  time  Is  used  to  derive  a  figure  of  merit. 
While  all  other  figures  of  merit  are  random  variables,  time  Is  not. 
Effectively  It  Is  known  with  certainty.  Even  In  this  very  simplified 
exanple  It  Is  clear  that  Inclusion  of  time  In  the  decision  criteria, 
treated  as  a  random  variable.  Is  Incorrect  unless  collection  management 
and  tasking  are  also  considered  In  which  case  random  variation  associated 
with  detection  and  signature  generation  must  be  considered  as  well. 


5-1 


5.0  TYPE  I  AND  TYPE  II  ERRORS 

In  the  preceding  sections  a  Bayesian  structure  was  utilized  to 
examine  simple  algorithms  and  decision  criteria  fOr  sel f- correlation. 

This  structure  Is  attractive  because  for  a  n ember  of  reasonably  general 
distribution  functions  It  provides  computationally  simple  decision 
criteria.  Given  that  such  dl strlbutlons  were  used  as  approximations  to 
actual  distributions  (for  reasons  of  clarity  and  or  ease  of  use)  It  Is 
likely  that  the  resulting  algorithms  would  be  close  to  optimal.  Aspects 
of  the  BETA  problem  are  difficult  and  complex,  particularly  the  nature  of 
the  decisions  themselves  and  the  prior  probabilities.  Nevertheless,  the 
structure  does  offer  Insights  Into  various  algorithm  attributes.  For 
example,  consider  the  Issue  of  type  I  and  type  II  errors.  For  the  simple 
problem  with  two  alternate  ",  the  decision  rule  derived  was  to  select 
d^  If: 

f(x|wi)  -d0^"P^ 

_  >  - 

f(x|w2)  p(l-£0) 

Suppose  the  components  of  x  are  Independent  and  normally  distributed 
and  let 

f(x|wi)  *  N(0,£) 
and  f(j«|w2)  *  N(ti,£) 

where  /  0  \ 

E'  (•  v)  • 

For  this  example  the  criterion  was: 

uixi  u2x2  ul2  m22  /Uo(l-p)\ 

o  i2  a22  2oi^  2a22  \P(l“£oW 
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The  probability  of  a  type  I  error  in  this  case  Is: 

T  ulxl  VZXZ  ul2  VZZ  “1 

Pr  L  °i2  +  a22  >  +  5?*  n\a-A,)p)  /j  1  J  ' 

and  the  probability  of  a  type  II  error  Is: 

l“ul*l  “2*2  ul2  u22  /(Vl-P)\|  1 

Pf  L'^2  +  “^2  <  2^2  +  2^2*  ,n\p(l^orj  I  W2J 

In  this  example,  conditional  on  W^: 


and 


wi£l 

t 


u«  2 


W2I2 


Thus,  vi jX_^  V1X9 

+ 


0 12 


a<>2 


^2  11^2 


o22 


and  the  probability  of  a  type  I  error  is  given  by: 


T  (  ul2  +  u22 
1-01  — *+— 5-ln 

I  \  \  2°1 2  2°2 


where  ^is  the  cumulative  normal  distribution  function.  Similarly,  the 
probability  of  a  type  II  error  Is  given  by: 

wl2  uz'‘  _  AoU-P)\\/Al2  u22^1/2 


2°i2  2o22  ‘  ^pd-^) Jjj \oxc  or 

A  similar  procedure  can  be  carried  out  for  any  decision  rule  derived  via 
a  Bayesian  structure  and  with  modification  for  any  other  structure  In 
trtilch  the  conditional  distributions  ftjclwj)  are  specified.  The 
critical  factor  Is  that  this  specification  for  a  combat  scenario  Is  not 
simple.  Moreover,  a  range  of  scenarios  probably  are  required. 


6.0  FIGURES  OF  MERIT  ON  THE  (MIT  INTERVAL 


To  examine  the  BETA  algorithm  as  currently  Implemented  first  note 
that  all  the  figures  of  merit  lie  on  the  unit  Interval.  To  accomplish 
this,  certain  transformations  have  been  made.  For  example,  the  measure 
of  separation  is  the  complementary  cunulatlve  distribution  of  the  randan 
varlabl  e  where  x  is  the  random  vector  of  differences  in 

location.  The  remainder  of  the  statistics  used  can  be  described  as 
fol  1  ows . 

Let  S  be  a  measured  difference  and  suppose  S  has  density  f(s!wj). 
then  define  a  variate  X  by: 


H 

Ixl 

0<S<£ 

O 

ft 

Ixl 

u<s 

u-s 

X  *  — 
U-£ 

£<S<u. 

L 

Pr[£  *  ll  w^]  * 

f  f(slv^)ds 
Jq 

Pr[7  ■  0|  w-f  ]  » 

f  f(slv^)ds 
J\x 

and  Pr[Y£x,x+dx]  •  f(u-(u-£)xlv^)dx  ^£s<u. 

Thus,  the  probability  that  £  exceeds  some  threshold  h  say  Is: 
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For  any  linear  combination  M»  of  such  variables  It  Is  possible 

to  determine  the  probability  distribution  of  M  conditional  on  wj  and 
thus  to  determine  the  type  I  and  type  II  error  probabilities.  Consider 
an  example  In  trfrfch 

f(s  «i)  *  w^e’wl8 

and  f(s  wg)  * 

Then 

f(s|wf) 

f(s|wz)  >  Y° 

Is  equivalent  to 

~e’s(vV  >  Y° 

w2 


or 


or 


-s{vrw2)  >  lnv0  - 
*  (,n  ■ 


The  two  decision  rules  are  equivalent  If  s*  i ,  x*h,  or 


Y1 


u-i 


In  this  case  the  probability  of  a  type  I  error  Is: 

PrCs>Yj  |w^]  ■  e  U1Y1, 

and  the  probability  of  a  type  II  error  Is: 

Prts^jJwj]  «  l-e”U2Yl 

If  the  rules  are  not  equivalent,  then  for  the  criterion: 

x  >  h 


u-s 

we  have  -  >  h 

u-4  ” 

or  s  <  u-h(u-£). 


i 


Hence: 


Prtx<l.|wl]  -  e 

and  Prtx^h| wj]  *  1-a 

The  BETA  algorithms  make  use  of  a  linear  combination  of  figures  of 
merit  taking  values  on  the  unit  Interval.  As  with  any  algorithm  (again 
provided  a  scenario  Is  specified)  probabilities  of  type  I  and  type  II 
errors  can  be  determined.  As  an  Illustration  consider  the  following 
example: 

Let  M  ■  ajXj  +  33*2 

and  ceelate  If 


Where 


Assune  that: 


and 


M  >_  h, 
al  +  *2  *  1 


0  <  X1  <  1 

1  «  1,2. 

ftxil  wi>  *  1 

0<xi<l 

f(x1|w2)  -  1 

0<Xj<1 

f ^  x2  |W2 )  • 

l-e"ul 

0<x2<l  • 

The  assunptlon  of  uniform  distributions  conditioned  on  W*w^  Is  not 
unreasonable  because,  as  noted  earlier  by  choice  of  transformation,  any 
random  variable  can  be  mapped  Into  an  equivalent  that  has  such  a  distri¬ 
bution.  In  particular,  the  separation  figure  of  merit  exp(-d/2)  has  a 
uniform  distribution  for  the  case  In  which  locations  are  the  same.  The 
distribution  assimed  for  W*w2  has  the  effect  of  making  values  close  to 
zero  more  probable.  Exhibit  $-1  Illustrates  the  resulting  type  I  and 
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type  II  errors  for  u  »  5.0.  The  example  shows  the  contribution  of  the 
non-uni formly  distributed  figure  of  merit:  as  more  weight  Is  given  to 
X£>  It  becomes  possible  to  achieve  lower  values  of  the  probabilities  of 
both  type  I  and  type  II  errors.  This  will  be  true  In  general.  I.e.,  for 
any  particular  set  of  weights  It  Is  possible  to  determine  the  curve 
relating  the  two  probabilities  as  a  function  of  the  decision  threshold  h. 
The  functions  required  to  carry  out  the  calculations  are  the  conditional 
distributions  of  the  decision  vector.  Ideally  one  wishes  to  set  thres¬ 
hold  values  for  desired  error  probabilities  on  the  lowest  such  curve, 
I.e.,  the  weights  defining  that  curve  would  specify  the  best  "BETA" 
algorithm,  although  not  necessarily  the  best  algorithm  possible.  Note 
the  Implications  of  renormalization.  If  a  data  element  Is  missing  the 
preferred  procedure  would  be  derive  appropriate  weights  by  finding  the 
lowest  curve  for  the  remaining  elements  In  the  data  vector.  Provided 
that  the  missing  element  Is  not  superfluous  this  curve  will  be  above  the 
original.  Thus,  even  If  thresholds  are  changed.  It  will  not  be  possible 
to  achieve  the  same  performance:  one  or  both  probabilities  will 
Increase.  In  a  Bayesian  context  this  corresponds  to  an  Increase  In  the 
expected  loss. 

The  examples  In  section  3.0  of  this  paper  suggest  that  If  the 
figures  of  merit  are  stochastically  dependent  the  Impact  of  renormaliza¬ 
tion  Is  to  move  to  an  operating  point  that  Is  above  the  optimal  curve  for 
the  reduced  data.  The  examples  for  both  Independence  and  dependence 
among  the  data  elements  suggest  that  the  "best"  BETA  curve  Is  above  the 
theoretical  optlmun.  The  magnitude  of  the  difference  should  be 
Investigated. 


7.0  REMARKS 


The  scope  of  the  examination  of  the  BETA  algorithms  mis  restricted 
to  those  used  for  self-correlation  with  particular  emphasis  on  EUNT. 

Two  questions  were  addressed: 

(1)  How  well  do  the  algorithms  perform?  and 

(2)  Is  the  normalization  procedure  sound? 

The  question  of  algorithm  performance  Is  closely  related  to  scen¬ 
ario.  The  Bayesian  structure  used  within  this  paper  represents  this 
relationship  by  means  of  losses,  prior  probabilities,  and  conditional 
distributions.  Within  this  structure  It  Is  possible  to  calculate,  for 
any  algorithm,  the  probabilities  of  type  I  and  type  II  errors.  In  the 
absence  of  actual  nunbers  and  a  specific  scenario,  the  procedures  were 
Illustrated  and  an  example  was  provided.  The  example  suggests  that  the 
general  concept  of  weights  can  be  explained,  l.e.,  one  wishes  to  give 
greatest  value  to  the  best  discriminants.  Note,  however,  that  all 
potential  decisions  must  be  considered  In  such  an  operation,  l.e.,  both 
“correlate"  and  "don't  correlate"  must  be  addressed.  In  the  context  of 
linear  combinations  of  figures  of  merit  taking  values  on  the  unit 
Interval  It  may  be  the  case  that  the  probability  of  a  type  I  error  Is  not 
critical;  rather  the  algorithms  have  been  designed  and  weights  assigned 
to  minimize  the  probability  of  a  type  II  error. 

The  question  of  normalization  was  examined  by  constructing  linear 
discriminants  and  examining  the  Impact  of  missing  data.  In  the  case  of 
Independent  figures  of  merit,  a  case  can  be  made  for  the  normalization 
procedure  provided  that  the  threshold  Is  changed  appropriately,  but  It  Is 
unlikely  the  case  holds  for  the  figures  of  merit  defined  on  the  unit 


i 
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Interval.  In  the  case  of  dependent  figures  of  merit  this  Is  not  true  and 
normalization  cannot  be  justified.  Since  It  Is  likely  that  the  figures 
of  merit  are  dependent,  different  procedures  should  be  adopted  to  account 
for  missing  data. 

The  treatment  of  temporal  data  In  the  current  BETA  algorithms 
appears  to  assume  that  times  are  a  random  variable.  Given  knowledge  of 
collection  management  and  sensor  tasking  such  an  assumption  might  be 
appropriate,  but  It  does  not  appear  that  this  Is  the  case  In  BETA.  A 
general  approach  to  the  Issue  Is  provided  In  section  4.0  In  which  both 
priors  and  conditional  distributions  are  made  functions  of  time  between 
reports . 

Currently,  results  from  tests,  etc.  are  the  only  means  of  assessing 
algorithm  performance.  Results  with  which  we  are  familiar  suggest  that 
the  current  versions  of  the  algorithms  forego  self-  correlation  to  avoid 
Incorrect  correlation,  1.e.,  type  II  errors.  All  other  things 
considered,  the  BETA  alorlthms  should  do  reasonably  well  against 
stationary  ELINT  targets.  For  a  varelty  of  reasons.  Including  the 
treatment  of  timing,  against  targets  which  displace,  this  Is  unlikely  to 
be  the  case.  Nevertheless,  given  the  distances  separating  most  ELINT 
targets  (excluding  GUNDISH  radars)  location  differences  should  be  a 
powerful  discriminant.  In  this  respect  It  may  be  worthwhile  to  use  an 
an  alternative  approach  and  enhance  the  distance  figure  of  merit 
exp(-d/2)  by  explicitly  considering  alternate  decisions. 
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1.0  INTRODUCTION 

The  purpose  of  this  paper  Is  to  describe  a  methodology  for  evaluat¬ 
ing  the  self- correlation  function  of  BETA  correlation.  Briefly  stated, 

BETA  self-correlation  Is  an  aid  In  deciding  If  two  reports  describe  the 
same  entity  or  not.  The  basic  approach  of  BETA  Is  to  calculate  statis¬ 
tics  or  flgures-of-merlt  based  on  the  contents  of  two  reports,  determine 

a  weighted  sum  of  the  statistics  to  generate  an  overall  statistic,  and 
then  compare  the  value  of  the  statistic  to  a  preset  threshold.  If  the 
statistic  attains  or  exceeds  the  threshold  then  the  reports  are  corre¬ 
lated,  l.e.,  they  are  said  to  describe  the  same  entity;  otherwise,  they 
are  uncorrelated,  l.e.,  they  are  said  not  to  describe  the  same  entity. 

The  remainder  of  this  paper  Is  divided  Into  five  chapters.  Chapter 
2.0  presents  a  problem  description  giving  the  context  and  concepts 
required  In  the  discussion  of  the  evaluation  methodology.  Chapter  3.0 
follows  with  a  description  of  how  BETA  self- correlation  functions. 
Chapter  4.0  Introduces  the  evaluation  criteria  as  the  cost  of  making 
correlation  errors  and  presents  a  cost  function  that  depends  on  the 
probabilities  of  making  such  errors.  Chapter  5.0  gives  a  mathematical 
description  of  the  evaluation  methodology.  Chapter  6.0  Identifies  some 
of  the  ways  In  which  the  methodology  could  be  used  to  Improve  the 
performance  of  BETA.  An  appendix  Is  Included  which  describes  the 
statistics  of  BETA  In  detail. 
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2.0  PROBLEM  DESCRIPTION 

This  chapter  describes  the  context  and  explains  the  concepts 
required  for  understanding  the  evaluation  methodology.  First,  the  con¬ 
cept  of  a  "scenario"  Is  explained.  Next,  an  "observable"  Is  defined,  the 
measurement  of  observables  discussed,  and  the  resulting  reports  are 
described.  Following  this,  the  function  of  self-correlation  Is  discussed 
and,  lastly,  the  Interpretation  of  the  reports  after  sel f-correlatlon  Is 
covered  for  completeness  sake. 

2.1  SCENARIO 

The  evaluation  of  an  aid  like  BETA  requires  a  knowledge  of  the 
various  situations  In  which  It  may  have  to  operate.  The  need  for  this 
knowledge  Is  twofold.  First,  the  value  of  the  aid  Is  typically  sensitive 
to  the  Importance  of  the  desirable  and  undesirable  events  that  might 
occur  and  their  tendencies  to  occur.  For  example.  If  the  Importance  of 
the  possible  outcomes  of  a  situation  Is  Inconsequential  or  If  all  out¬ 
comes  are  equally  acceptable  except  for  one  undesirable  outcome  and  the 
tendency  for  Its  occurence  Is  negligible,  then  the  aid,  regardless  of  Its 
performance.  Is  of  little  value.  The  Identification  of  the  desirable  and 
undesirable  events  of  a  situation  will,  to  some  degree,  define  the  situa¬ 
tions  In  which  the  aid  may  have  to  operate.  Second,  the  need  to  describe 
how  the  aid  will  operate  necessitates  a  description  of  Its  response  for 
any  given  situation.  Then,  given  the  situation,  we  can  characterize  Its 
response  and  measure  Its  performance.  The  description  of  the  environment 
or  conditions  under  which  the  aid  Is  to  operate  Is  called  a  scenario.  In 
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summary,  a  scenario  Is  a  description,  over  time,  of  all  events  that  play 
a  role  In  the  activities  or  performance  of  the  aid. 

Here,  the  aid  Is  the  BETA  self-correlation,  which  Is  concerned  with 
the  correlation  of  reports  sunaarlzlng  detections  made  on  electromagnetic 
emissions  such  as  radar  and  radio.  Some  Items  of  a  scenario  could  be  the 
Identity  of  detectable  emitters,  where  they  are  located,  what  detectable 
emissions  are  made,  what  sensors  are  used,  what  detections  are  made  by 
the  sensors,  what  are  the  conditions  under  which  the  detections  were 
made,  where  are  the  sensors  located,  and  so  on,  all  given  as  a  function 
of  time.  Once  the  scenario  has  been  described,  the  expected  performance 
of  BETA  can  be  specified. 

2.2  OBSERVABLES  AND  MEASUREMENT 

As  Implied  In  section  2.1,  sensors  are  used  to  collect  Information. 
The  physical  characteristics  of  phenomenon  detected  by  a  sensor  are 
called  observables.  The  observables  of  electromagnetic  radiation  used  in 
BETA  are  frequency,  pulse  width,  pulse  repetition  interval,  location  of 
the  source  (possibly  derived  from  lines- of- bearing) ,  and  times  of 
observation. 

When  a  sensor  detects  an  emission.  It  results  In  a  measurement  of 
one  or  more  observables.  Normally  the  estimated  value  and  actual  value 
of  the  observable  differ  by  some  unknown  amount.  The  degree  to  which 
they  agree  Is  a  measure  of  the  performance  of  the  sensor.  The  probabil¬ 
istic  description  of  how  well  a  measurement  might  agree  with  reality 
defines  the  sensor  capability. 
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Given  the  scenario,  the  probabilistic  nature  of  the  statistics  cal¬ 
culated  In  BETA  depends  completely  upon  sensor  performance.1-  In  gen¬ 
eral,  the  more  a  measurement  may  deviate  fran  the  actual  value,  the  more 
a  dependent  statistic  will  vary  .n  Its  value.  On  the  other  hand.  If  the 
sensors  are  perfect  (measured  value  and  true  value  agree),  then  the 
dependent  statistics  are  deterministic.  The  study  of  the  statistics  In 
BETA  will  rely  primarily  on  the  description  of  a  sensor's  capability  to 
measure  observables. 

2.3  REPORTS 

A  report  Is  simply  a  sunmary  of  the  measurements  made  by  one  or  more 
sensors  on  a  single  element  during  the  same  observation  time.  Reports 
are  the  Items  correlated  during  sel  f-correl atlon.  Ideally,  a  report 
should  describe  exactly  one  entity;  otherwise,  the  concept  of  "errors"  In 
correlation  becomes  complicated.  For  example.  If  report  A  describes  two 
unique  entitles  a  and  b  as  though  they  were  one  entity  and  report.B 
describes  entity  b,  then  Is  it  correct  to  correlate  A  and  8  or  to  uicor- 
relate  them?  Unfortunately,  It  may  be  necessary  to  merge  two  correlated 
reports  Into  one  report  to  reduce  the  mmber  of  reports  to  retain.  This 
may  be  necessary,  for  example,  when  reports  are  stored  In  a  computer  data 
base  with  a  snail  memory  capacity.  A  report  that  represents  the  merging 
of  two  or  more  reports  Is  called  an  updated  report. 


^Assuming  that  all  measurements,  once  made,  remain  unaltered. 
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2.4  SELF-CORRELATION 

The  purpose  of  sel f- correlation  Is  to  declare  a  belief  or  disbelief 
that  two  reports  describe  the  sane  entity  or  emitter.  This  process  aids 
In  resolving  the  grouping  or  fusion  problem  of  deciding  what  set  of 
reports  describes  the  sane  entity,  thereby  obtaining  a  fuller  description 
of  the  entity  and  avoiding  confusion  and  erroneous  conclusions  about  the 
entity  due  to  Improper  grouping. 

The  general  approach  BETA  uses  In  deciding  If  two  reports  correlate 
or  not  Is  to  calculate  a  statistic  based  on  the  types  of  Information  com¬ 
mon  to  both  reports  and  then  compare  the  result  to  a  predetermined  thres¬ 
hold.  If  the  threshold  Is  attained  or  exceeded  then  the  reports  are  cor¬ 
related;  that  is,  they  are  declared  to  describe  the  same  entity.  Other¬ 
wise  they  are  uncorrelated;  that  is,  they  are  declared  to  describe  separ¬ 
ate  entitles. 

Envisioning  the  reports  to  be  stored  In  a  data  base  of  a  computer, 
the  effect  of  self-correlation  can  be  described  as  changing  a  relation¬ 
ship  between  two  reports  from  an  "unknown"  status  to  either  "correlated" 
or  "uncorrelated."  If  there  are  no  "unknown"  relationships,  then  the 
data  base  has  been  fully  examined  and  Is  ready  for  further 
Interpretation. 

2.5  IHTERPRETATION 

For  completeness,  the  step  following  the  sel f- correlation  function 
Is  briefly  discussed.  Once  the  reports  have  been  sel f- correlated,  the 
problem  of  determining  vrfiat  reports  describe  the  same  entity  has  been  at 
least  partially  solved.  The  problem  may  be  not  completely  solved  because 
a  chain  of  correlated  reports  may  not  all  correlate  with  one  another 
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(e.g.,  A  collates  with  B,  B  correlates  with  C,  but  A  does  not  correlate 
with  C).  This  problan  Is  avoided  when  all  correlated  reports  are  merged 
Into  updated  reports,  but  at  the  expense  of  propagating  any  correlation 
errors  made  earlier  In  the  history  of  an  updated  report.  Assunlng  such 
problems  are  somehow  resolved,  we  are  left  with  reports  that  have  been 
grouped,  hopefully,  to  provide  the  highest  degree  of  entity  description 
the  reports  have  to  offer. 

The  next  step  Is  to  collect  groups  of  correlated  reports  and  examine 
than  to  create  more  complex  relationships  among  than  that  Imply  organiza¬ 
tional  structures,  deplo>ment,  and  missions.  The  BETA  correlation 
attempts  to  aid  In  this  interpretation  effort,  but  BETA  correlation 
beyond  sel f- correlation  Is  not  considered  here. 
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3.0  SELF -CORRELATION  IN  BETA  CORRELATION 


The  purpose  of  this  section  Is  to  describe.  In  same  detail,  how 
self- correlation  Is  done  In  BETA. 

There  are  two  previously  uncorrelated  reports  selected  fbr  correla¬ 
tion:  one  report  Is  called  the  "suhject"  and  the  other  called  the 
"candidate."  The  perception  In  BETA  has  the  subject  as  a  new  report  just 
received  and  the  candidate  as  an  old,  possibly  updated,  report  retrieved 
from  a  data  base  of  reports. 

The  estimates  of  each  observable  common  to  both  reports  Individually 
compared  and  a  statistic  for  the  type  of  observable  calculated.  For 
example,  suppose  both  reports  have  an  estimate  of  frequency.  Let 
u  *  frequency  estimate  of  suhject,  and 
Uc  *  frequency  estimate  of  the  candidate. 

The  frequency  statistic  Is  defined  as  follows: 


V2 


1 

^  |  U  -  Uc  |  -  *2 

^  *  *2 

0 


,  |  U  -  Uc  |  <  *2 
.  *2  <|U  -Uc|<  l* 
,  otherwise. 


where  and  mg  are  arbitrary  thresholds. 

If  U  Is  1  MHz,  Uc  Is  1.1  MHz,  *2  1s  -05  MHz,  and  a£  Is  .2  MHz, 
then  the  frequency  statistic  has  a  value  of  2/3.  The  statistics  for  the 
other  observables  (location,  pulse  width,  pulse  repetition  Interval, 
and  observation  times)  are  given  In  appendix  A.  Note  that  the 
statistic  varies  between  zero  and  one  and  Increases  as  the  absolute 
difference  of  the  measured  frequencies  decreases.  This  general  behavior 
Is  common  to  all  the  remaining  statistics. 


-  i 
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An  overall  statistic  Is  formed  as  a  weighted  sun  of  the  Individual 
statistics  calculated.  Let  H  be  the  overall  statistic  and  Vj  the 
statistic  corresponding  to  observable  type  1<  Then, 

I>v1 

lei 

H  »  -  . 

2>i 

i«i 

Here,  I  Is  the  set  of  i  where  observable  type  i  has  been  estimated  in 
both  candidate  and  subject,  and  d-f  Is  a  weighting  factor  assigned  to 
observable  type  1. 

The  statistic  H  is  then  compared  to  a  predetermined  threshold  h.  If 
H  attains  or  exceeds  this  threshold,  then  the  two  reports  are  correlated; 
otherwise,  they  are  uncorrelated.  Note  that  the  result  cannot  be 
"unknown" ,  which  was  the  status  before  the  reports  were  correlated. 

Although  not  an  Important  assunptlon,  it  Is  assuned  that  the  thres¬ 
hold  h  exists  Independently  of  the  contents  of  the  set  I  defined  above. 

If  h  Is  allowed  to  depend  on  I,  then  there  Is  no  real  need  to  renormalize 
the  weights  by  the  division  performed  above  because  new  thresholds  could 
be  defined  by  multiplying  them  by  the  renormal  izatlon  factor.  However, 
the  effect  of  renormalization  Is  to  restrict  H  to  the  Interval  fron  zero 
to  one  which  may  be  a  desirable  characteristic. 
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4.0  EVALUATION  CRITERIA 

This  section  develops  a  mathematical  expression  for  evaluating  the 
performance  of  the  BETA  self- correlation  function.  The  assumptions  lead¬ 
ing  to  the  evaluation  criteria  are  first  Identified,  then  the  expression 
Is  given  and  discussed. 

Typically,  the  way  to  Investigate  the  performance  of  an  aid  like 
BETA  Is  to  study  the  types  of  errors  It  may  make  and  the  tendancy  to  make 
them  under  various  conditions  or  scenarios.  In  BETA  there  are  two  types 
of  errors  that  can  be  made:  type  1  and  type  2.  A  type  1  error  occurs 
trfien  two  reports  are  not  correlated  and  they  describe  the  same  entity.  A 
type  2  error  occurs  when  two  reports  are  correlated  and  they  describe  two 
unique  entitles.  To  specify  the  performance  of  self-correlation  for  a 
given  scenario  and  correlation  event  Is  to  give  the  probability  of  making 
a  type  1  and  type  2  errors.  The  concept  of  a  “correlation  error"  becomes 
more  complicated  when  one  of  the  reports,  l.e.,  the  candidate,  of  a  cor¬ 
relation  event  Is  an  updated  report  and  conceivably  describes  more  than 
one  entity.  If  the  candidate  report  describes  two  or  more  entitles  as 
one  entity,  then  correlating  or  uncorrelating  It  with  a  subject  report 
results  In  one  or  more  type  2  or  type  1  errors,  respectively.  For  exam¬ 
ple,  suppose  reports  A  and  B  describing  entitles  ea  and  e^,  respec¬ 
tively,  are  merged  Into  a  candidate  report  C.  Further,  suppose  report  S 
describing  entity  e^  Is  to  be  correlated  with  C.  If  they  are  corre¬ 
lated,  the  effect  Is  to  correlate  S  with  A  and  B,  resulting  In  a  type  2 
error.  If  they  are  uncorrelated,  then  S  Is  effectively  uncorrelated  with 
A  and  B  for  one  type  1  error.  Note  that  If  the  candidate  report  Is  an 
updated  report,  then  both  types  of  errors  may  apply  for  a  given  scenario 
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and  correlation  event.  If  the  candidate  Is  not  an  updated  report,  then 
only  one  type  of  error  will  apply. 

In  the  remainder  of  this  paper.  It  Is  assianed  the  candidate  and  sub¬ 
ject  are  not  updated  reports.  The  primary  reason  for  this  assunptlon  Is 
that  an  evaluation  methodology  of  BETA  sel f- correlation  then  updated 
reports  are  present  will  depend  on  how  measurmnents  from  two  reports  are 
combined  when  creating  an  updated  report.  It  Is  not  clear  how  this  Is 
done. 

Ihe  value  of  an  aid  like  BETA  Is  reflected  to  the  extent  wdeslrable 
events  are  avoided  and  desirable  events  are  encountered  through  Its  use. 
Normally  the  value  of  an  aid  Is  the  extent  to  which  It  reduces  some  kind 
of  average  or  expected  "costs"  through  Its  use. 

We  assmne  here  that  a  functional  relationship  exists  between  cost 
.  reduction  and  performance  Improvement  of  BETA.  Thl s  means  we  can  study 
the  probability  of  correlation  errors  and,  through  a  functional  relation¬ 
ship,  determine  the  associated  costs.  If  we  make  the  additional  assunp- 
tlons  that  costs  resulting  from  a  correlation  error  are  additive  and  all 
pairs  of  reports  are  put  through  the  correlation  process,  then  we  can 
write  the  expected  cost  of  errors  In  a  given  scenario,  S,  as 

CIS).  £  C,j  C,J. 

I  t  J 

where 

Pjj(S)  *  probability  of  making  a  type  1  or  type  2  error 

(only  one  will  apply)  when  correlating  reports  1  and  j 
under  scenario  S,  and 


Cjj(S)  *  the  cost  associated  with  waking  the  error  Indicated 
by  P^tS). 

Although  this  additive  form  Is  not  necessary  for  evaluation,  the 
dependency  of  expected  costs  solely  on  the  probability  of  waking  type  1 
and  type  2  errors  Is  desirable  In  order  to  slwpllfy  the  problem  to  that 
of  determining  the  probability  of  waking  these  errors. 

To  evaluate  the  performance  of  BETA  over  a  class  or  distribution  of 
scenarios  with  cowwensurate  costs,  the  overall  expected  cost  is: 

C  -  £c(s,p(s,» 

S 

where  S  Is  suwned  over  all  scenarios  In  the  class  or  distribution,  and 
P(S)  Is  a  weighting  factor  reflecting  the  relative  likelihood  or 
probability  that  scenario  S  might  occur. 

The  degree  to  which  the  use  of  BETA  reduces  C(S)  or  C  Is  a  measure 
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5.0  PROBABILITY  OF  CORRELATION  ERRORS 

To  evaluate  BETA  self-correlation,  we  have  assuaed  a  cost  function 
dependent  only  on  the  probability  of  making  type  1  and  type  2  correla¬ 
tion  errors.  This  section  developes  a  mathematical  expression 
for  calculating  these  probabilities. 

Given  the  scenario,  everything  Is  specified  except  the  actual  meas¬ 
urements  made  by  the  sensors  which  are  probabilistic  In  nature.  This, 

In  turn.  Implies  a  probabilistic  behavior  In  making  correlation  errors. 

The  probabilistic  nature  of  a  sensor  Is  given  by  the  general  probability 

function*  f(mj,  m2,...,  ms  |  aj,  83,...,  85)  which  gives  the 

relative  likelihood  the  measured  values  of  observables  1,  2,  ....  5  Is  mlt 

m2 . ms  given  the  actual  values  are  a^,  83,  ...»  85,  respectively. 

Although  not  explicitly  stated,  this  function  maty  well  be  dependent  on  other 
aspects  of  the  scenario  (like  weather). 

The  dynamics  of  an  observable  1  from  a  particular  entity  Is  given  by 
Its  actual  value  as  a  function  of  time  or  a^(t).  The  probability  of  a 
type  I  or  type  2  error  In  correlation  Is  P(H  <  h)  or  P(H  ^  h),  respec¬ 
tively,  remembering  that  only  one  type  of  error  will  apply.  The 
expression  for  a  type  1  error  Is: 

P(H  <  h)  ■ 

g(vj,  v2,...,  v5 ) , 

vi:  divi  v»:  dov?  V5:  <tevs 

<  h*  <  h*  -  djv  <  h*  -  dtvi  -  ...  -  d4V4 


*A  general  probability  function  may  be  a  probability  density  function 
If  the  variable  Is  continuous,  a  probability  mass  function  if  the 
variable  Is  discrete,  or  a  combination  of  both. 
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where  h*  *  h£d-f  over  1  el,  I  Is  set  1  such  that  observable  1  has 
been  measured  In  both  candidate  and  subject  reports,  and  g(vi,  V2>  ....  V5) 
Is  the  joint  general  probability  function1  of  the  statistics  V},  V2»  ...,  V5 
given  the  scenario  and  the  two  reports  Involved.  Note  that  depending  on 
the  set  I,  some  of  the  sues  above  and  parameters  of  g  may  not  appear.  An 
expression  for  g  In  terms  of  the  target  dynamics,  7(t)  and  sensor(s) 
performance,  f(m  |  7(t)) ,  where  m  »  (mp  m2,  . ...  mg)  and 
7(t)  *  (a^ft) . 85 (t) )  Is: 

gfv)  ■  f(  m  |  T ( t) )  f  (1^  |  Icltg)), 

m  9  mc 

such  that  _ 

T  (m,  mc)  a  v. 

The  subscript  c  represents  values  associated  with  the  candidate, 
no  subscript  refers  to  the  subject,  and  T( m,  mc)  Is  the  vector  with 
components  Y^(m,  mc)  that  represents  the  functional  relationship  between 
statistic  Vj  for  observable  type  1  and  the  candidate  and  subject  esti¬ 
mates  of  the  observable.  Finally,  t  has  a  special  meaning  here:  It 
represents  the  Interval  of  observation  during  which  the  sensor  Is  collect¬ 
ing  Information  to  estimate  the  observables.  Note,  that  In  the  above,  the 
sensors  are  assuned  to  operate  Independently. 

The  probability  of  making  a  type  2  error  Is  simply  1  -  P (H  <  h) 
under  the  realization  that  the  two  reports  now  describe  separate 
entitles. 


^Thus  for  the  continuous  statistics  the  corresponding  suns  above  Is  replaced 
by  an  Integral  operator. 
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To  sumurlze,  mo  will  briefly  review  what  has  been  done  mathemati¬ 
cally  above. 

Ihe  criteria  for  correlating  two  reports  depends  on  the  value  of  a 
statistic  H.  Thus,  the  probability  of  making  a  correlation  error  depends 
on  the  probability  the  statistic  has  a  value  that  leads  to  the  wrong 
conclusion.  The  probability  distribution  of  the  statistic  depends  on  two 
things:  the  scenario,  which  Includes  the  system  (dynamics  and  all  thres¬ 
holds,  and  the  measurement  capabilities  of  the  sensors.  These  factors 
define  a  joint  probability  distribution  of  the  component  statistics  which 
Is  the  function  g(7).  To  determine  the  probability  of  making  a 
correlation  error,  we  simply  sum  g(7)  over  all  possible  values  of 
the  component  statistics,  which  leads  to  a  value  of  the  overall 
statistic  H  that  produces  an  erroneous  conclusion. 
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6.0  USES  OF  EVALUATION  METHODOLOGY 

There  are  a  nunber  of  ways  the  evaluation  Methodology  could  be  used 
to  Improve  the  performance  of  BETA  self- correlation  under  given  scenar¬ 
ios.  Seme  of  these  uses  are  described  here. 

There  exist  numerous  thresholds  and  weights  In  BETA  which  are  to  be 
determined  by  some  means.  Using  the  evaluation  methodology  presented 
here,  values  for  these  parameters  could  be  determined  for  a  given  cost 
function  and  scenario  or  set  of  scenarios.  The  approach  Is  obvious: 
determine  the  values  of  the  parameters  which  minimizes  the  expected  cost 
function. 

The  selection  of  better  statistics  than  those  used  In  BETA  could  be 
made  using  the  methodology  by  selecting  those  statistics  which  produce 
lower  expected  costs  for  a  given  set  of  scenarios.  In  fact,  the  method¬ 
ology  could  be  used  to  develop  optimal  statistics  In  the  sense  of  mini¬ 
mizing  expected  costs.  The  optimal  threshold  statistic  (like  the  "HM 
statistic  of  BETA)  and  optimal  threshold  value  can,  in  theory,  be  deter¬ 
mined  from  the  target  dynamics,  sensor  performance  and  cost  function.  Of 
course,  the  difficult  part  Is  characterizing  sensor  performance  and  the 
target  dynamics  under  the  scenarios  of  Interest,  and  developing  a  reason¬ 
able  cost  function. 

Finally,  the  methodology  can  be  used  to  answer  questions  concerning 
the  value  of  Information.  Simply  put,  the  expected  value  of  obtaining  an 
additional  Item  of  Information  Is  the  expected  reduction  In  costs  that 
results  In  obtaining  and  using  the  Information. 


[ 
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APPENDIX  A: 

FISURE-OF-MERIT  STATISTICS  IN  BETA  SELF-CORRELATION 

The  following  describes  the  flgure-of-merlt  (FOM)  statistics  of  the 
BETA  self-correlation  algorithm.  First,  the  variables  used  In  the  calcu¬ 
lation  of  the  FOMs  are  defined  and  then  the  mathenatlcal  expression  for 
each  FCM  Is  given.  There  are  two  reports:  the  subject  (a  new  report) 
and  the  candidate  (selected  fron  a  data  base  of  old  reports). 

Variables 

X, Y  3  location  estimate; 

U  *  frequency  estimate; 

W  3  pul se  width; 

R  3  pulse  repetition  frequency; 

a  3  first  time  entity  observed;  and 

b  3  last  time  entity  observed. 

Variables  with  no  subscript  are  estimates  of  the  subject.  Variables  with 
a  subscript  of  c,  e.g.,  Xc,  are  estimates  of  the  candidate.  A  variable 
can  be  "unknown,"  which  means  the  variable  was  not  estimated  In  the 
report.  If  a  variable  is  unknown,  a  FOM  Is  not  calculated  fbr  the 
variable. 


where 


Vi  *  e"D/2t 

£  •  Variance  -  covariance  Matrix  of  (X, Y) ; 


I 


l.. 


/  o2xx  o2xy\ 

V2xy  o^yy/ 


c  *  Variance  -  covariance  Matrix  of  (XC,YC) 


Frequency 


V2  »  <  1  - 


I  U  -  UC  1  -*2 
^  ~l2 


where  and  are  given  parameters. 


Pulse  Width 


I  H  -  Wc  |  -  *3 

»J  -  *3 


where  £3  and  m3  are  given. 


.  |U  -Uc  |<£2 
,  i2  <|U  -Ucli«6 

,  otherwise. 


.U-Wc|<£3 

.  *3  il«  -WCli"9 

,  otherwise. 


T 

I 


Pulse  Repetition  Frequency 

I  R/2  -  Rc  I 


1  - 


1  - 


l  - 


HVi  R/2 

I  R  -  *c  I 
■  4R 

I  a  -  «c  I 
2n^  R 


A- 3 


,  (1  -  **)  R/2  <  Rc  <  (1  +  m*)  R/2 


,(i  -  m|)R£Rc><(i  +  m|)R 


,2  (1  -«m  )R  _<  Rc  _<  2  (1  +  )R 


,  otherwise. 


where  104  Is  a  given  parameter. 


T  Ime 


v5  »  {  1  -  («<;  -  b)  /«5 
1  -  (a  -  bc)/n$ 
0 


,  a  ^  a^.  _<  b  or  a  ^  bc  _<  b  or 

*c  i  a  1  b  i  bc 
,  b  <  ac  <  b  +  n$ 

,  b  -  n$  <  bc  <  b 
,  otherwise 


Exhibit  A-la  through  A-le  graphically  Illustrate  the  dependence  of  on 
the  variables  defined  above. 
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EXHIBIT  A-l :  ILLUSTRATION  OF  FOMS 
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